ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	9

Source

Journal of Educational and…

Publication Type

Journal Articles	9
Reports - Research	4
Reports - Evaluative	3
Reports - Descriptive	2

Education Level

Elementary Education	3
Grade 4	3
Intermediate Grades	2
Elementary Secondary Education	1
Grade 8	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Sweden	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	9
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

IRT and MIRT Models for Item Parameter Estimation with Multidimensional Multistage Tests

Peer reviewed

Direct link

Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020

In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…

Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

Using Heteroskedastic Ordered Probit Models to Recover Moments of Continuous Test Score Distributions from Coarsened Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Reardon, Sean F.; Shear, Benjamin R.; Castellano, Katherine E.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2017

Test score distributions of schools or demographic groups are often summarized by frequencies of students scoring in a small number of ordered proficiency categories. We show that heteroskedastic ordered probit (HETOP) models can be used to estimate means and standard deviations of multiple groups' test score distributions from such data. Because…

Descriptors: Scores, Statistical Analysis, Models, Computation

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

Determining Sample Sizes for Precise Contrast Analysis with Heterogeneous Variances

Peer reviewed

Direct link

Jan, Show-Li; Shieh, Gwowen – Journal of Educational and Behavioral Statistics, 2014

The analysis of variance (ANOVA) is one of the most frequently used statistical analyses in practical applications. Accordingly, the single and multiple comparison procedures are frequently applied to assess the differences among mean effects. However, the underlying assumption of homogeneous variances may not always be tenable. This study…

Descriptors: Sample Size, Statistical Analysis, Computation, Probability

Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

Peer reviewed

Direct link

Ho, Andrew D.; Reardon, Sean F. – Journal of Educational and Behavioral Statistics, 2012

Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This article introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

Descriptors: Achievement Gap, Scores, Computation, Classification

The Impact of Variability of Item Parameter Estimators on Test Information Function

Peer reviewed

Direct link

Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2012

The impact of uncertainty about item parameters on test information functions is investigated. The information function of a test is one of the most important tools in item response theory (IRT). Inaccuracy in the estimation of test information can have substantial consequences on data analyses based on IRT. In this article, the major part (called…

Descriptors: Item Response Theory, Tests, Accuracy, Data Analysis

Performance of Random Effects Model Estimators under Complex Sampling Designs

Peer reviewed

Direct link

Jia, Yue; Stokes, Lynne; Harris, Ian; Wang, Yan – Journal of Educational and Behavioral Statistics, 2011

In this article, we consider estimation of parameters of random effects models from samples collected via complex multistage designs. Incorporation of sampling weights is one way to reduce estimation bias due to unequal probabilities of selection. Several weighting methods have been proposed in the literature for estimating the parameters of…

Descriptors: Sampling, Computation, Statistical Bias, Statistical Analysis

On the Estimation of Hierarchical Latent Regression Models for Large-Scale Assessments

Peer reviewed

Direct link

Li, Deping; Oranje, Andreas; Jiang, Yanlin – Journal of Educational and Behavioral Statistics, 2009

To find population proficiency distributions, a two-level hierarchical linear model may be applied to large-scale survey assessments such as the National Assessment of Educational Progress (NAEP). The model and parameter estimation are developed and a simulation was carried out to evaluate parameter recovery. Subsequently, both a hierarchical and…

Descriptors: Computation, National Competency Tests, Measurement, Regression (Statistics)

Computation	9
National Competency Tests	8
Item Response Theory	4
Models	4
Statistical Analysis	4
Error of Measurement	3
Grade 4	3
Monte Carlo Methods	3
Reading Tests	3
Sample Size	3
Scores	3
Test Items	3
College Entrance Examinations	2
Comparative Analysis	2
Measurement	2
Probability	2
Accuracy	1
Achievement Gap	1
Adaptive Testing	1
Bias	1
Classification	1
Computer Assisted Testing	1
Correlation	1
Data Analysis	1
Educational Assessment	1
More ▼

Ho, Andrew D.	2
Oranje, Andreas	2
Reardon, Sean F.	2
Castellano, Katherine E.	1
Harris, Ian	1
Jan, Show-Li	1
Jewsbury, Paul A.	1
Jia, Yue	1
Jiang, Yanlin	1
Kolstad, Andrew	1
Li, Deping	1
Ramsay, James O.	1
Shear, Benjamin R.	1
Shieh, Gwowen	1
Stokes, Lynne	1
Wang, Yan	1
Wiberg, Marie	1
Zhang, Jinming	1
van Rijn, Peter W.	1
More ▼