ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	9

Descriptor

Bayesian Statistics	9
Educational Testing	9
Item Response Theory	5
Scores	4
Statistical Analysis	4
Educational Assessment	3
Educational Research	3
Longitudinal Studies	3
Achievement Gains	2
Achievement Tests	2
Correlation	2
Educational Policy	2
Error of Measurement	2
Evaluation Methods	2
Inferences	2
Models	2
Probability	2
Psychometrics	2
Student Evaluation	2
Accountability	1
Accuracy	1
Adaptive Testing	1
Algorithms	1
Children	1
Cognitive Measurement	1
More ▼

Source

ETS Research Report Series	3
Journal of Educational and…	2
Grantee Submission	1
Journal of Educational…	1
Measurement:…	1
National Center for Education…	1

Publication Type

Journal Articles	7
Reports - Research	6
Opinion Papers	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Secondary Education	1

Audience

Location

New York

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
Program for International…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Optimizing Large-Scale Educational Assessment with a "Divide-and-Conquer" Strategy: Fast and Efficient Distributed Bayesian Inference in IRT Models

Peer reviewed

Direct link

Sainan Xu; Jing Lu; Jiwei Zhang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational…

Descriptors: Educational Assessment, Bayesian Statistics, Statistical Inference, Item Response Theory

Bayesian Nonparametric Monotone Regression of Dynamic Latent Traits in Item Response Theory Models

Peer reviewed

Direct link

Liu, Yang; Wang, Xiaojing – Journal of Educational and Behavioral Statistics, 2020

Parametric methods, such as autoregressive models or latent growth modeling, are usually inflexible to model the dependence and nonlinear effects among the changes of latent traits whenever the time gap is irregular and the recorded time points are individually varying. Often in practice, the growth trend of latent traits is subject to certain…

Descriptors: Bayesian Statistics, Nonparametric Statistics, Regression (Statistics), Item Response Theory

A Note on Explaining Away and Paradoxical Results in Multidimensional Item Response Theory. Research Report. ETS RR-12-13

Peer reviewed
PDF on ERIC

Download full text

van Rijn, Peter W.; Rijmen, Frank – ETS Research Report Series, 2012

Hooker and colleagues addressed a paradoxical situation that can arise in the application of multidimensional item response theory (MIRT) models to educational test data. We demonstrate that this MIRT paradox is an instance of the explaining-away phenomenon in Bayesian networks, and we attempt to enhance the understanding of MIRT models by placing…

Descriptors: Item Response Theory, Educational Testing, Bayesian Statistics, Statistical Analysis

The Case for Informal Argument

Peer reviewed

Direct link

Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2012

Paul E. Newton's "Clarifying the Consensus Definition of Validity" addresses the single most important, yet stubbornly protean, value in educational and psychological assessment. "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council on Measurement in…

Descriptors: Evidence, Validity, Educational Testing, Psychological Evaluation

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

Error Rates in Measuring Teacher and School Performance Based on Student Test Score Gains. NCEE 2010-4004

Peer reviewed
PDF on ERIC

Download full text

Schochet, Peter Z.; Chiang, Hanley S. – National Center for Education Evaluation and Regional Assistance, 2010

This paper addresses likely error rates for measuring teacher and school performance in the upper elementary grades using value-added models applied to student test score gain data. Using realistic performance measurement system schemes based on hypothesis testing, we develop error rate formulas based on OLS and Empirical Bayes estimators.…

Descriptors: Teacher Effectiveness, Teacher Evaluation, Student Evaluation, Scores

Modeling Change in Large-Scale Longitudinal Studies of Educational Growth: Four Decades of Contributions to the Assessment of Educational Growth. Research Report. ETS RR-12-04. ETS R&D Scientific and Policy Contributions Series. ETS SPC-12-01

Peer reviewed
PDF on ERIC

Download full text

Rock, Donald A. – ETS Research Report Series, 2012

This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…

Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development

Modeling Diagnostic Assessments with Bayesian Networks

Peer reviewed

Direct link

Almond, Russell G.; DiBello, Louis V.; Moulder, Brad; Zapata-Rivera, Juan-Diego – Journal of Educational Measurement, 2007

This paper defines Bayesian network models and examines their applications to IRT-based cognitive diagnostic modeling. These models are especially suited to building inference engines designed to be synchronous with the finer grained student models that arise in skills diagnostic assessment. Aspects of the theory and use of Bayesian network models…

Descriptors: Inferences, Models, Item Response Theory, Cognitive Measurement

Almond, Russell G.	1
Boyd, Donald	1
Chiang, Hanley S.	1
Chun Wang	1
DiBello, Louis V.	1
Gongjun Xu	1
Jing Lu	1
Jiwei Zhang	1
Lankford, Hamilton	1
Liu, Yang	1
Loeb, Susanna	1
Mislevy, Robert J.	1
Moulder, Brad	1
Rijmen, Frank	1
Rock, Donald A.	1
Sainan Xu	1
Schochet, Peter Z.	1
Wang, Xiaojing	1
Wyckoff, James	1
Zapata-Rivera, Juan-Diego	1
Zwick, Rebecca	1
van Rijn, Peter W.	1
More ▼