ERIC - Search Results

Publication Date

In 2026	0
Since 2025	53
Since 2022 (last 5 years)	411
Since 2017 (last 10 years)	914
Since 2007 (last 20 years)	1965

Descriptor

Error of Measurement	3311
Statistical Analysis	600
Scores	510
Item Response Theory	449
Correlation	434
Comparative Analysis	424
Foreign Countries	418
Test Reliability	412
Computation	407
Simulation	370
Reliability	357
Sample Size	354
Models	353
Evaluation Methods	350
Test Items	349
Measurement Techniques	318
Factor Analysis	311
Sampling	301
Statistical Bias	300
Research Methodology	288
Goodness of Fit	260
Psychometrics	259
Monte Carlo Methods	258
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	271
Secondary Education	201
Postsecondary Education	197
Elementary Education	194
Elementary Secondary Education	126
Middle Schools	98
High Schools	82
Junior High Schools	78
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	17
New York	15
United Kingdom	15
North Carolina	14
Texas	14
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Spain	11
Japan	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 2,836 to 2,850 of 3,311 results Save | Export

The Fallibility of High Stakes "11-Plus" Testing in Northern Ireland

Peer reviewed

Direct link

Gardner, John; Cowan, Pamela – Assessment in Education Principles Policy and Practice, 2005

This paper sets out the findings from a large-scale analysis of the Northern Ireland Transfer Procedure Tests, used to select pupils for grammar schools. As it was not possible to get completed test scripts from government agencies, over 3000 practice scripts were completed in simulated conditions and were analysed to establish whether the tests…

Descriptors: Foreign Countries, Educational Testing, Error of Measurement, Test Use

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

Mixed-Effects Logistic Regression Models for Indirectly Observed Discrete Outcome Variables

Peer reviewed

Direct link

Vermunt, Jeroen K. – Multivariate Behavioral Research, 2005

A well-established approach to modeling clustered data introduces random effects in the model of interest. Mixed-effects logistic regression models can be used to predict discrete outcome variables when observations are correlated. An extension of the mixed-effects logistic regression model is presented in which the dependent variable is a latent…

Descriptors: Predictor Variables, Correlation, Maximum Likelihood Statistics, Error of Measurement

Graduate Student WAIS-III Scoring Accuracy Is a Function of Full Scale IQ and Complexity of Examiner Tasks

Peer reviewed

Direct link

Hopwood, Christopher J.; Richard, David C. S. – Assessment, 2005

Research on the Wechsler Adult Intelligence Scale-Revised and Wechsler Adult Intelligence Scale-Third Edition (WAIS-III) suggests that practicing clinical psychologists and graduate students make item-level scoring errors that affect IQ, index, and subtest scores. Studies have been limited in that Full-Scale IQ (FSIQ) and examiner administration,…

Descriptors: Scoring, Psychologists, Intelligence Quotient, Graduate Students

The Children Born in 2001 at Kindergarten Entry: First Findings from the Kindergarten Data Collections of the Early Childhood Longitudinal Study, Birth Cohort (ECLS-B). First Look. NCES 2010-005

Peer reviewed
PDF on ERIC

Download full text

Flanagan, Kristin Denton; McPhee, Cameron – National Center for Education Statistics, 2009

Using data from the final two rounds of the Early Childhood Longitudinal Study, Birth Cohort (ECLS-B), a longitudinal study begun in 2001, this First Look provides a snapshot of the demographic characteristics, reading and mathematics knowledge, fine motor skills, school characteristics, and before- and after-school care arrangements of the cohort…

Descriptors: Child Development, Kindergarten, Longitudinal Studies, Cohort Analysis

A Case of the Inapplicability of the Rasch Model: Mapping Conceptual Learning

Peer reviewed
PDF on ERIC

Download full text

Direct link

Stacey, Kaye; Steinle, Vicki – Mathematics Education Research Journal, 2006

The basic theory of Rasch measurement applies to situations where a person has a certain level of a trait being investigated, and this level of ability is what determines (to within a measurement error) how well the person does on each item in a test. This paper responds to frequent suggestions from colleagues that the use of Rasch measurement…

Descriptors: Measurement, Error of Measurement, Item Response Theory, Construct Validity

The Response in Response to Intervention: Evaluating the Utility of Assessing Maintenance of Intervention Effects

Peer reviewed

Direct link

Ardoin, Scott P. – Psychology in the Schools, 2006

Extensive evidence exists demonstrating the utility of Curriculum-Based Measurement in reading (R-CBM) for progress-monitoring purposes; however, most studies have evaluated R-CBM from a traditional psychometric perspective, which allows for variability in individual student's data that is not a function of increased skills (i.e., measurement…

Descriptors: Psychometrics, Measurement, Maintenance, Intervention

A Formal Cognitive Model of the Go/No-Go Discrimination Task: Evaluation and Implications

Peer reviewed

Direct link

Yechiam, Eldad; Goodnight, Jackson; Bates, John E.; Busemeyer, Jerome R.; Dodge, Kenneth A.; Pettit, Gregory S.; Newman, Joseph P. – Psychological Assessment, 2006

This article proposes and tests a formal cognitive model for the go/no-go discrimination task. In this task, the performer chooses whether to respond to stimuli and receives rewards for responding to certain stimuli and punishments for responding to others. Three cognitive models were evaluated on the basis of data from a longitudinal study…

Descriptors: Evaluation Research, Task Analysis, Adolescents, Longitudinal Studies

A Simulation Study of Methods for Assessing Differential Item Functioning in Computer-Adaptive Tests.

Download full text

Zwick, Rebecca; And Others – 1993

Simulated data were used to investigate the performance of modified versions of the Mantel-Haenszel and standardization methods of differential item functioning (DIF) analysis in computer-adaptive tests (CATs). Each "examinee" received 25 items out of a 75-item pool. A three-parameter logistic item response model was assumed, and…

Descriptors: Adaptive Testing, Computer Assisted Testing, Correlation, Error of Measurement

Least Principal Components Analysis (LPCA): An Alternative to Regression Analysis.

Download full text

Olson, Jeffery E. – 1992

Often, all of the variables in a model are latent, random, or subject to measurement error, or there is not an obvious dependent variable. When any of these conditions exist, an appropriate method for estimating the linear relationships among the variables is Least Principal Components Analysis. Least Principal Components are robust, consistent,…

Descriptors: Error of Measurement, Factor Analysis, Goodness of Fit, Mathematical Models

Reliability of Essay Rating and Score Adjustment. Program Statistics Research Technical Report No. 93-36.

Download full text

Longford, Nicholas T. – 1993

A model-based approach to rater reliability for essays read by multiple readers is presented. Variation of rater severity (between-rater variation) and rater inconsistency (within-rater variation) is considered in the presence of between-examinee variation. An additive variance component model is posited and the method of moments for its…

Descriptors: Educational Diagnosis, Error of Measurement, Essays, Estimation (Mathematics)

Relative Performance of Rescaling and Resampling Approaches to Model Chi Square and Parameter Standard Error Estimation in Structural Equation Modeling.

Download full text

Nevitt, Johnathan; Hancock, Gregory R. – 1998

Though common structural equation modeling (SEM) methods are predicated upon the assumption of multivariate normality, applied researchers often find themselves with data clearly violating this assumption and without sufficient sample size to use distribution-free estimation methods. Fortunately, promising alternatives are being integrated into…

Descriptors: Chi Square, Computer Software, Error of Measurement, Estimation (Mathematics)

Lies, Damn Lies, and Statistics Revisited: A Comparison of Three Methods of Representing Change. AIR 1991 Annual Forum Paper.

Download full text

Pike, Gary R. – 1991

Because change is fundamental to education and the measurement of change assesses the quality and effectiveness of postsecondary education, this study examined three methods of measuring change: (1) gain scores; (2) residual scores; and (3) repeated measures. Data for the study was obtained from transcripts of 722 graduating seniors at the…

Descriptors: Academic Achievement, College Seniors, Error of Measurement, Higher Education

Experiences in the Application of Item Response Theory in Test Construction.

Green, Donald Ross; And Others – 1988

Potential benefits of using item response theory in test construction are evaluated, based on the experience and evidence accumulated during 9 years of using a three-parameter model in the construction of major achievement batteries. Specific benefits covered include obtaining sample-free item calibrations and item-free person measurement,…

Descriptors: Achievement Tests, Computer Assisted Testing, Difficulty Level, Elementary Secondary Education

What Do Ratings of Infant Temperament Really Measure?

Download full text

MacPhee, David – 1983

As data on the reliability and validity of ratings of infant temperament have accumulated, researchers have begun to ask what caregiver ratings really measure. An argument has been made that ratings of social behavior are less a reflection of enduring individual differences than a measure of rater characteristics and error variance. This study…

Descriptors: Error of Measurement, Experimenter Characteristics, Infants, Knowledge Level

« Previous Page | Next Page »

Pages: 1 | ... | 186 | 187 | 188 | 189 | 190 | 191 | 192 | 193 | 194 | ... | 221

Educational and Psychological…	259
Journal of Educational…	115
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	71
Journal of Experimental…	70
ETS Research Report Series	59
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	47
Journal of Psychoeducational…	37
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
Practical Assessment,…	27
International Journal of…	26
Journal of Educational…	26
National Center for Education…	25
Psychology in the Schools	25
Structural Equation Modeling	23
More ▼

Journal Articles	2358
Reports - Research	1905
Reports - Evaluative	703
Reports - Descriptive	344
Speeches/Meeting Papers	329
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	27
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	45
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
ACT Assessment	20
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	9
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼