ERIC - Search Results

Publication Date

In 2026	0
Since 2025	59
Since 2022 (last 5 years)	416
Since 2017 (last 10 years)	919
Since 2007 (last 20 years)	1970

Descriptor

Error of Measurement	3316
Statistical Analysis	602
Scores	511
Item Response Theory	449
Correlation	434
Comparative Analysis	424
Foreign Countries	418
Test Reliability	412
Computation	407
Simulation	370
Reliability	357
Sample Size	355
Models	353
Evaluation Methods	350
Test Items	349
Measurement Techniques	318
Factor Analysis	311
Sampling	301
Statistical Bias	300
Research Methodology	288
Goodness of Fit	260
Psychometrics	260
Monte Carlo Methods	259
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	271
Secondary Education	201
Postsecondary Education	197
Elementary Education	194
Elementary Secondary Education	126
Middle Schools	98
High Schools	82
Junior High Schools	78
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	17
New York	15
United Kingdom	15
North Carolina	14
Texas	14
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Spain	11
Japan	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 1,471 to 1,485 of 3,316 results Save | Export

How Large Should a Statistical Sample Be?

Peer reviewed

Direct link

Menil, Violeta C.; Ye, Ruili – MathAMATYC Educator, 2012

This study serves as a teaching aid for teachers of introductory statistics. The aim of this study was limited to determining various sample sizes when estimating population proportion. Tables on sample sizes were generated using a C[superscript ++] program, which depends on population size, degree of precision or error level, and confidence…

Descriptors: Sample Size, Probability, Statistics, Sampling

Step Arounds for Common Pitfalls When Valuing Resources Used versus Resources Produced

Peer reviewed

Direct link

Yates, Brian T. – New Directions for Evaluation, 2012

The value of a program can be understood as referring not only to outcomes, but also to how those outcomes compare to the types and amounts of resources expended to produce the outcomes. Major potential mistakes and biases in assessing the worth of resources consumed, as well as the value of outcomes produced, are explored. Most of these occur…

Descriptors: Program Evaluation, Cost Effectiveness, Evaluation Criteria, Evaluation Problems

Rater Language Background as a Source of Measurement Error in the Testing of English Language Learners

Peer reviewed

Direct link

Kachchaf, Rachel; Solano-Flores, Guillermo – Applied Measurement in Education, 2012

We examined how rater language background affects the scoring of short-answer, open-ended test items in the assessment of English language learners (ELLs). Four native English and four native Spanish-speaking certified bilingual teachers scored 107 responses of fourth- and fifth-grade Spanish-speaking ELLs to mathematics items administered in…

Descriptors: Error of Measurement, English Language Learners, Scoring, Bilingual Teachers

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

International Test Score Comparisons and Educational Policy: A Review of the Critiques

Peer reviewed
PDF on ERIC

Download full text

Carnoy, Martin – National Education Policy Center, 2015

Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…

Descriptors: Criticism, Reputation, Test Validity, Error of Measurement

Formulation of a DIMTEST Effect Size Measure (DESM) and Evaluation of the DESM Estimator Bias

Peer reviewed

Direct link

Seo, Minhee; Roussos, Louis A. – Journal of Educational Measurement, 2010

DIMTEST is a widely used and studied method for testing the hypothesis of test unidimensionality as represented by local item independence. However, DIMTEST does not report the amount of multidimensionality that exists in data when rejecting its null. To provide more information regarding the degree to which data depart from unidimensionality, a…

Descriptors: Effect Size, Statistical Bias, Computation, Test Length

Assessing Short-Term Individual Consistency Using IRT-Based Statistics

Peer reviewed
PDF on ERIC

Download full text

Ferrando, Pere J. – Psicologica: International Journal of Methodology and Experimental Psychology, 2010

This article proposes a procedure, based on a global statistic, for assessing intra-individual consistency in a test-retest design with a short-term retest interval. The procedure is developed within the framework of parametric item response theory, and the statistic is a likelihood-based measure that can be considered as an extension of the…

Descriptors: Item Response Theory, Intervals, Psychometrics, Testing

Reporting Valid and Reliable Overall Scores and Domain Scores

Peer reviewed

Direct link

Yao, Lihua – Journal of Educational Measurement, 2010

In educational assessment, overall scores obtained by simply averaging a number of domain scores are sometimes reported. However, simply averaging the domain scores ignores the fact that different domains have different score points, that scores from those domains are related, and that at different score points the relationship between overall…

Descriptors: Educational Assessment, Error of Measurement, Item Response Theory, Scores

Qualification Users' Perceptions and Experiences of Assessment Reliability

Peer reviewed

Direct link

Chamberlain, Suzanne – Research Papers in Education, 2013

This paper presents the findings of a study designed to explore qualification users' perceptions and experiences of reliability in the context of national assessment outcomes in England. The study consisted of 17 focus groups conducted across six sectors of qualification users: students, teachers, trainee teachers, job-seekers, employers and…

Descriptors: Qualifications, Test Reliability, Foreign Countries, Focus Groups

Estimating Impacts of Treatment Random Assignment on Classroom Quality in the Head Start Impact Study: The Problem of Missing Data

Peer reviewed
PDF on ERIC

Download full text

Friedman-Krauss, Allison H.; Connors, Maia C.; Morris, Pamela A. – Society for Research on Educational Effectiveness, 2013

As a result of the 1998 reauthorization of Head Start, the Department of Health and Human Services conducted a national evaluation of the Head Start program. The goal of Head Start is to improve the school readiness skills of low-income children in the United States. There is a substantial body of experimental and correlational research that has…

Descriptors: Early Intervention, Preschool Education, School Readiness, Low Income Groups

IRR: A Blind Guide

Peer reviewed
PDF on ERIC

Download full text

Kierulff, Herbert – American Journal of Business Education, 2012

Over the past 60 years the internal rate of return (IRR) has become a major tool in investment evaluation. Many executives prefer it to net present value (NPV), presumably because they can more easily comprehend a percentage measure. This article demonstrates that, except in the rare case of an investment that is followed by a single cash return,…

Descriptors: Outcomes of Education, Measurement Techniques, Outcome Measures, Definitions

Measurement Invariance of Posttraumatic Stress Disorder Symptoms across Three Civilian Trauma Types

Direct link

Carter, Benjamin Hammond – ProQuest LLC, 2012

The factor structure of posttraumatic stress disorder (PTSD) remains the subject of intense investigation. The DSM three-factor conceptualization of PTSD has not been empirically supported; rather, two four-factor models of PTSD (King, Leskin, King, & Weathers, 1998; Simms, Watson, & Doebbeling, 2002) have garnered the majority of support…

Descriptors: Factor Structure, Posttraumatic Stress Disorder, Trauma, Symptoms (Individual Disorders)

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Factor Structure of the Revised TOEIC[R] Test: A Multiple-Sample Analysis

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2012

This study examined the factor structure of the listening and reading sections of the revised Test of English for International Communication (TOEIC[R]) test. The data from the TOEIC IP (institutional program) test taken by 569 English learners were randomly split into two samples (n = 285 vs. 284). Four models (higher-order, correlated,…

Descriptors: Communication (Thought Transfer), Second Language Learning, Factor Structure, Measurement

Fixing the c Parameter in the Three-Parameter Logistic Model

Peer reviewed
PDF on ERIC

Download full text

Han, Kyung T. – Practical Assessment, Research & Evaluation, 2012

For several decades, the "three-parameter logistic model" (3PLM) has been the dominant choice for practitioners in the field of educational measurement for modeling examinees' response data from multiple-choice (MC) items. Past studies, however, have pointed out that the c-parameter of 3PLM should not be interpreted as a guessing…

Descriptors: Statistical Analysis, Models, Multiple Choice Tests, Guessing (Tests)

« Previous Page | Next Page »

Pages: 1 | ... | 95 | 96 | 97 | 98 | 99 | 100 | 101 | 102 | 103 | ... | 222

Educational and Psychological…	259
Journal of Educational…	117
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	71
Journal of Experimental…	70
ETS Research Report Series	59
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	47
Journal of Psychoeducational…	38
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
Practical Assessment,…	27
International Journal of…	26
Journal of Educational…	26
National Center for Education…	25
Psychology in the Schools	25
International Journal of…	23
More ▼

Journal Articles	2363
Reports - Research	1909
Reports - Evaluative	704
Reports - Descriptive	344
Speeches/Meeting Papers	329
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	27
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	45
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
ACT Assessment	20
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	9
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼