Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 410 |
| Since 2017 (last 10 years) | 913 |
| Since 2007 (last 20 years) | 1964 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Cason, Gerald J.; And Others – 1983
Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…
Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education
Hendrickson, Les – 1981
The study of the same people at several occasions is known by various names: panel studies, multi-wave, multi-variable models, cohort analysis, and gain score analysis. The use of econometric concepts, path analysis, and structural equation analysis to study multi-wave, multi-variable models became widespread in methodological practice in the late…
Descriptors: Achievement Gains, Cohort Analysis, Comparative Analysis, Elementary Education
Tatsuoka, Kikumi – 1980
This paper presents a new method for estimating a given latent trait variable by the least-squares approach. The beta weights are obtained recursively with the help of Fourier series and expressed as functions of item parameters of response curves. The values of the latent trait variable estimated by this method and by maximum likelihood method…
Descriptors: Computer Assisted Testing, Error of Measurement, Higher Education, Latent Trait Theory
Dunivant, Noel – 1979
Eight different methods are reviewed for determining whether two or more tests are equivalent measures. These methods vary in restrictiveness from the Wilks-Votaw test of compound symmetry (which requires that all means, variances, and covariances are equal), to Joreskog's theory of congeneric tests (which requires only that the tests are measures…
Descriptors: Analysis of Variance, Comparative Analysis, Error of Measurement, Evaluation Methods
Marco, Gary L. – 1968
Normative data were obtained on the performance of first-year graduate students on the Aptitude Test and Advanced Tests of the Graduate Record Examinations. The population consisted of students enrolled as full-time graduate students for the first time in the fall of 1964 in a college or university belonging to the Council of Graduate Schools…
Descriptors: Achievement Tests, Aptitude Tests, College Entrance Examinations, Error of Measurement
Frayer, Dorothy A. – 1971
A Paradigm for testing concept attainment, comprised of twelve tasks, was formulated. These tasks were hypothesized to form a cumulative hierarchy. Tests were constructed in mathematics and social studies using the paradigm. Data for these tests was analyzed by Kaiser's method for fitting a perfect simplex and Schonemann's method for fitting a…
Descriptors: Concept Formation, Correlation, Data Analysis, Error of Measurement
Marshall, J. Laird – 1976
A summary is provided of the rationale for questioning the applicability of classical reliability measures to criterion referenced tests; an extension of the classical theory of true and error scores to incorporate a theory of dichotomous decisions; a presentation of the mean split-half coefficient of agreement, a single-administration test index…
Descriptors: Career Development, Computer Programs, Criterion Referenced Tests, Decision Making
Peer reviewedKolen, Michael J.; Jarjoura, David – Psychometrika, 1987
A cubic spline method for smoothing equipercentile equating relationships under the common item nonequivalent populations design is described. Statistical techniques based on bootstrap estimation are presented for choosing an equating method/degree of smoothing. Smoothing decreases the estimate of random error but results in an increase in…
Descriptors: Analysis of Variance, Equated Scores, Error of Measurement, Estimation (Mathematics)
Peer reviewedHinde, Robert A.; Dennis, Amanda – International Journal of Behavioral Development, 1986
Argues that rank order correlations may not be ubiquitously suitable for assessing the relations between children's behavior or characteristics at one age or in one situation, and those shown later or in another context. (HOD)
Descriptors: Age Differences, Aggression, Analysis of Variance, Behavior Patterns
Peer reviewedBlasiak, Wladyslaw – Physics Education, 1983
Classifies errors as either systematic or blunder and uncertainties as either systematic or random. Discusses use of error/uncertainty analysis in direct/indirect measurement, describing the process of planning experiments to ensure lowest possible uncertainty. Also considers appropriate level of error analysis for high school physics students'…
Descriptors: Error of Measurement, Error Patterns, High Schools, Mathematics Skills
Lee, Guemin; Lewis, Daniel M. – 2001
The Bookmark Standard Setting Procedure (Lewis, Mitzel, and Green, 1996) is an item-response-theory-based standard setting method that has been widely implemented by state testing programs. The primary purposes of this study were to: (1) estimate standard errors for cutscores that result from Bookmark standard settings under a generalizability…
Descriptors: Cutting Scores, Elementary School Students, Elementary Secondary Education, Error of Measurement
Liu, Jinghua; Feigenbaum, Miriam; Cook, Linda – College Entrance Examination Board, 2004
This study explored possible configurations of the new SAT® critical reading section without analogy items. The item pool contained items from SAT verbal (SAT-V) sections of 14 previously administered SAT tests, calibrated using the three-parameter logistic IRT model. Multiple versions of several prototypes that do not contain analogy items were…
Descriptors: College Entrance Examinations, Critical Reading, Logical Thinking, Difficulty Level
Peer reviewedWhitely, Susan E. – Applied Psychological Measurement, 1979
Two sources of inconsistency were separated by reanalyzing data from a major study on short-term consistency. Little evidence was found for generalizability or behavioral predictability. Results supported the assumption that measurement error from short-term fluctuations is not due to systematic individual differences in response consistency.…
Descriptors: Behavior Change, Cognitive Processes, College Freshmen, Error of Measurement
Peer reviewedIsrael, Glenn D.; Taylor, C. L. – Evaluation and Program Planning, 1990
Mail questionnaire items that are susceptible to order effects were examined using data from 168 questionnaires in a Florida Cooperative Extension Service evaluation. Order effects were found for multiple-response and attributive questions but not for single-response items. Order also interacted with question complexity, social desirability, and…
Descriptors: Adult Farmer Education, Difficulty Level, Educational Assessment, Error of Measurement
Peer reviewedLennox, Richard D.; Dennis, Michael L. – Evaluation and Program Planning, 1994
Potential methods are explored for removing or otherwise controlling random measurement error, assessment artifacts, irrelevant variation in outcome measures, and confounding sources of covariation in a structural equations model. Using examples with measures of quality of life and functioning, the authors consider these methods for field…
Descriptors: Error of Measurement, Field Studies, Measurement Techniques, Models


