ERIC - Search Results

Publication Date

In 2026	0
Since 2025	52
Since 2022 (last 5 years)	410
Since 2017 (last 10 years)	913
Since 2007 (last 20 years)	1964

Descriptor

Error of Measurement	3310
Statistical Analysis	600
Scores	510
Item Response Theory	449
Correlation	434
Comparative Analysis	424
Foreign Countries	418
Test Reliability	412
Computation	406
Simulation	370
Reliability	357
Sample Size	354
Models	353
Evaluation Methods	350
Test Items	349
Measurement Techniques	318
Factor Analysis	311
Sampling	300
Statistical Bias	300
Research Methodology	288
Goodness of Fit	260
Psychometrics	259
Monte Carlo Methods	258
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	271
Secondary Education	201
Postsecondary Education	197
Elementary Education	194
Elementary Secondary Education	126
Middle Schools	98
High Schools	82
Junior High Schools	78
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	17
New York	15
United Kingdom	15
North Carolina	14
Texas	14
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Spain	11
Japan	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 106 to 120 of 3,310 results Save | Export

The Mechanics of Treatment-Effect Estimate Bias for Nonexperimental Data

Peer reviewed

Direct link

Penaloza, Roberto V.; Berends, Mark – Sociological Methods & Research, 2022

To measure "treatment" effects, social science researchers typically rely on nonexperimental data. In education, school and teacher effects on students are often measured through value-added models (VAMs) that are not fully understood. We propose a framework that relates to the education production function in its most flexible form and…

Descriptors: Data, Value Added Models, Error of Measurement, Correlation

Novel Randomization Tests for Two-Sample Multiple-Baseline Designs

Peer reviewed

Direct link

Levin, Joel R.; Ferron, John M.; Gafurov, Boris S. – Journal of Education for Students Placed at Risk, 2022

The present simulation study examined the statistical properties (namely, Type I error and statistical power) of various novel randomized single-case multiple-baseline designs and associated randomized-test analyses for comparing the A- to B-phase immediate abrupt outcome changes in two independent intervention conditions. It was found that with…

Descriptors: Statistical Analysis, Error of Measurement, Intervention, Program Effectiveness

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

3PL and 4PL Multiprocess Models

Direct link

Ryan Derickson – ProQuest LLC, 2022

Item Response Theory (IRT) models are a popular analytic method for self report data. We show how traditional IRT models can be vulnerable to specific kinds of asymmetric measurement error (AME) in self-report data, because the models spread the error to all estimates -- even those of items that do not contribute error. We quantify the impact of…

Descriptors: Item Response Theory, Measurement Techniques, Error of Measurement, Models

How Did Spain Perform in PISA 2018? New Estimates of Children's PISA Reading Scores

Peer reviewed

Direct link

John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024

International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Validation of the Higher Education Student Engagement Scale in Use for Program Evaluation

Peer reviewed

Direct link

Stella Y. Kim; Carl Westine; Tong Wu; Derek Maher – Journal of College Student Retention: Research, Theory & Practice, 2024

The primary purpose of this study is to validate a student engagement measure for its use in evaluation of a learning assistant (LA) program. A series of psychometric evaluations were made for both the original scale of Higher Education Student Engagement Scale (HESES) and its adapted version designed to be used in gauging the effectiveness of…

Descriptors: Learner Engagement, Teaching Assistants, Test Validity, Test Reliability

Comparing Measurement Reliability Estimation Techniques: Correlation Coefficient vs. Bland-Altman Plot

Peer reviewed

Direct link

Tülin Otbiçer Acar – Measurement: Interdisciplinary Research and Perspectives, 2024

The aim of this study is to compare the results of correlation coefficient estimation of reliability with those obtained through the Bland-Altman plot technique. The scale was first divided into two halves using three different approaches. A linear and high-level relationship was found between the scale scores obtained from the halved forms.…

Descriptors: High School Students, Measurement Techniques, Psychometrics, Comparative Testing

Aiming at Creativity and Ending up with a Range from Low-Hanging Fruits to Foolishness: A Reflective Model of Creativity

Peer reviewed

Direct link

Nicolas Pichot; Boris Forthmann; Eric Bonetto; Thomas Arciszewski; Nathalie Bonnardel; Sara Jaubert; Jean B. Pavani – Journal of Creative Behavior, 2024

The term "creative" is commonly used in everyday language and in academic discourse to discuss the nature of artistic and innovative productions. This usage inherently implies the existence of a variable of creativity that allows different creative works to be compared. The standard definition of creativity asserts that a production must…

Descriptors: Creativity, Test Construction, Test Validity, Productive Thinking

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Direct Discrepancy Dynamic Fit Index Cutoffs for Arbitrary Covariance Structure Models

Peer reviewed

Direct link

Daniel McNeish; Melissa G. Wolf – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Despite the popularity of traditional fit index cutoffs like RMSEA [less than or equal to] 0.06 and CFI [greater than or equal to] 0.95, several studies have noted issues with overgeneralizing traditional cutoffs. Computational methods have been proposed to avoid overgeneralization by deriving cutoffs specifically tailored to the characteristics…

Descriptors: Structural Equation Models, Cutting Scores, Generalizability Theory, Error of Measurement

Comparing Accuracy of Parallel Analysis and Fit Statistics for Estimating the Number of Factors with Ordered Categorical Data in Exploratory Factor Analysis

Peer reviewed

Direct link

Hyunjung Lee; Heining Cham – Educational and Psychological Measurement, 2024

Determining the number of factors in exploratory factor analysis (EFA) is crucial because it affects the rest of the analysis and the conclusions of the study. Researchers have developed various methods for deciding the number of factors to retain in EFA, but this remains one of the most difficult decisions in the EFA. The purpose of this study is…

Descriptors: Factor Structure, Factor Analysis, Monte Carlo Methods, Goodness of Fit

Investigating Latent Interaction Effects in Multiple-Group Analysis in the Structural Equation Modeling Framework

Peer reviewed

Direct link

Suyoung Kim; Sooyong Lee; Jiwon Kim; Tiffany A. Whittaker – Structural Equation Modeling: A Multidisciplinary Journal, 2024

This study aims to address a gap in the social and behavioral sciences literature concerning interaction effects between latent factors in multiple-group analysis. By comparing two approaches for estimating latent interactions within multiple-group analysis frameworks using simulation studies and empirical data, we assess their relative merits.…

Descriptors: Social Science Research, Behavioral Sciences, Structural Equation Models, Statistical Analysis

Detecting Careless Responding in Multidimensional Forced-Choice Questionnaires

Peer reviewed

Direct link

Rebekka Kupffer; Susanne Frick; Eunike Wetzel – Educational and Psychological Measurement, 2024

The multidimensional forced-choice (MFC) format is an alternative to rating scales in which participants rank items according to how well the items describe them. Currently, little is known about how to detect careless responding in MFC data. The aim of this study was to adapt a number of indices used for rating scales to the MFC format and…

Descriptors: Measurement Techniques, Alternative Assessment, Rating Scales, Questionnaires

Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups

Peer reviewed

Direct link

Sean Joo; Montserrat Valdivia; Dubravka Svetina Valdivia; Leslie Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Evaluating scale comparability in international large-scale assessments depends on measurement invariance (MI). The root mean square deviation (RMSD) is a standard method for establishing MI in several programs, such as the Programme for International Student Assessment and the Programme for the International Assessment of Adult Competencies.…

Descriptors: International Assessment, Monte Carlo Methods, Statistical Studies, Error of Measurement

Improving the Precision of Classroom Observation Scores Using a Multi-Rater and Multi-Timepoint Item Response Theory Model

Peer reviewed

Direct link

Kelly Edwards; James Soland – Educational Assessment, 2024

Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…

Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores

« Previous Page | Next Page »

Pages: 1 | ... | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ... | 221

Educational and Psychological…	259
Journal of Educational…	115
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	70
Journal of Experimental…	70
ETS Research Report Series	59
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	47
Journal of Psychoeducational…	37
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
Practical Assessment,…	27
International Journal of…	26
Journal of Educational…	26
National Center for Education…	25
Psychology in the Schools	25
Structural Equation Modeling	23
More ▼

Journal Articles	2358
Reports - Research	1904
Reports - Evaluative	703
Reports - Descriptive	344
Speeches/Meeting Papers	329
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	27
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	45
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
ACT Assessment	20
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	9
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼