Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Brown, William L.; And Others – 1996
This study presents psychometric characteristics of the mathematics problem solving performance assessment used in the Minneapolis Public Schools, focusing on the interrater reliability, scoring reliability, and validity of the assessment. The Minneapolis Math Problem Solving Assessment (MPSA) was established in 1991. Students are asked to solve…
Descriptors: Elementary School Students, Grade 5, Intermediate Grades, Interrater Reliability
Huang, Chi-yu; And Others – 1995
Generalizability theory is used to examine the sources of variability present in a teacher and course evaluation instrument. Two studies were conducted. In the first study, four different forms commonly used by one specific college of a large midwestern university were examined using responses of 915 students. The analysis of variance performed on…
Descriptors: Analysis of Variance, College Students, Course Evaluation, Evaluation Methods
Fago, George C. – 1995
When William G. Perry (1968) developed his scheme of nine stages of cognitive development, most of which are experienced during the college years, he did not attempt to quantify it. Subsequently, T. D. Erwin (1983) constructed a scale that attempted to quantify the Perry scheme. His findings supported the overall conception of student development…
Descriptors: Cognitive Development, Cognitive Processes, Concept Formation, Developmental Stages
Shorey, Leonard – 1991
Tests in social studies and integrated science given in Saint Vincent, Saint Lucia, Grenada, and Dominica were analyzed by the Organization for Co-operation in Overseas Development (OCOD) Comprehensive Teacher Training Program (CTTP) for discrimination, difficulty, and reliability, as well as other characteristics. There were 767 examinees for the…
Descriptors: Difficulty Level, Elementary Secondary Education, Evaluation Methods, Foreign Countries
Van der Linden, Wim J. – 1995
This paper addresses the problem of how to place students in a sequence of hierarchically related courses from an (empirical) Bayesian point of view. Based on a minimal set of assumptions, it is shown that optimal mastery rules for the courses are always monotone and a nonincreasing function of the scores on the placement test. On the other hand,…
Descriptors: Course Content, Course Objectives, Elementary Secondary Education, Foreign Countries
Leyva, Collette – 1997
The Test of Pragmatic Language (TOPL) is an individually administered instrument designed to assess pragmatic language skills that can be used with students in kindergarten through high school. It is more specifically intended for use with children, adolescents, and adults with learning disabilities, language delays, reading difficulties, or…
Descriptors: Adolescents, Adults, Children, Communication Skills
Daniel, Larry G.; Witta, E. Lea – 1997
Although reliability and validity are characteristics of test data, social scientists often attribute reliability and validity erroneously to the tests themselves. To determine the extent to which this problem exists, 150 reliability and validity studies selected from 3 prominent social science measurement journals over a 3-year period were…
Descriptors: Graduate Students, Graduate Study, Higher Education, Language Role
Rudner, Lawrence M. – 1996
In educational research and evaluation, a sample of subjects usually received some type of programmatic treatment. Outcome scores for these students are then compared with outcome scores of a control or comparison group. M. Lewis and H. McGurk (1972) have pointed out that there are some implicit assumptions when this approach is applied to…
Descriptors: Child Development, Cognitive Development, Early Childhood Education, Educational Research
Kaufman, Alan S.; And Others – 1994
The reliability and validity of three short forms of the Wechsler Intelligence Scale for Children III (WISC-III) were compared. Each of the short forms was a tetrad composed of two verbal and two performance subtests. The first tetrad was selected based primarily on practical considerations, particularly its brevity to administer and score. The…
Descriptors: Adolescents, Age Differences, Children, Clinical Diagnosis
Aycock, Tim – 1993
To determine trends in reporting test reliability, 88 articles addressing 188 instruments in 1980, 81 articles covering 205 instruments in 1985, and 67 articles assessing 195 instruments in 1990 in the "Journal of Counseling Psychology" were reviewed. Articles were examined for the way in which reliability was discussed and reported, and…
Descriptors: Educational Practices, Educational Research, Estimation (Mathematics), Interrater Reliability
Ackerman, Terry A.; Evans, John A. – 1992
The relationship between levels of reliability and the power of two bias and differential item functioning (DIF) detection methods is examined. Both methods, the Mantel-Haenszel (MH) procedure of P. W. Holland and D. T. Thayer (1988) and the Simultaneous Item Bias (SIB) procedure of R. Shealy and W. Stout (1991), use examinees' raw scores as a…
Descriptors: Comparative Analysis, Equations (Mathematics), Error of Measurement, Item Bias
Bartel, Kathleen – 1991
Literacy Volunteers of America (LVA) affiliates were surveyed regarding standardized and informal assessment devices they currently used and their frequency of use and effectiveness. A literature review focused on assessment tools and their limitations. The survey was sent to 39 LVA affiliates in Illinois, Indiana, Michigan, Missouri, Ohio, and…
Descriptors: Adult Basic Education, Adult Literacy, Evaluation Utilization, Informal Assessment
McNamara, T. F.; Adams, R. J. – 1991
A preliminary study is reported of the use of new multifaceted Rasch measurement mechanisms for investigating rater characteristics in language testing. Ratings from four judges of scripts from 50 candidates taking the International English Language Testing System test, a test of English for Academic Purposes, are analyzed. The analysis…
Descriptors: Comparative Analysis, English (Second Language), Foreign Countries, Interrater Reliability
Adams, Raymond J.; Khoo, Siek-Toon – 1993
The Quest program offers a comprehensive test and questionnaire analysis environment by providing a data analyst (a computer program) with access to the most recent developments in Rasch measurement theory, as well as a range of traditional analysis procedures. This manual helps the user use Quest to construct and validate variables based on…
Descriptors: Computer Assisted Testing, Computer Software, Estimation (Mathematics), Foreign Countries
MacPhee, David; Fritz, Janet J.; Miller-Heyl, Jan; Hite, Judy – 1998
Although mastery motivation appears to predict school success, individual assessment of mastery motivation is too time consuming and limits the application of this research. This study examined the psychometric properties of the Dimensions of Mastery Questionnaire (DMQ). The study focused on the validity of the measure for Head Start parents,…
Descriptors: Child Rearing, Construct Validity, Interpersonal Competence, Measurement Techniques


