ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Statistical Analysis	11
Item Response Theory	7
National Competency Tests	7
Models	5
Test Items	5
Educational Assessment	4
Mathematics Tests	4
Data Analysis	3
National Surveys	3
Psychometrics	3
Reading Tests	3
Scores	3
Bayesian Statistics	2
Difficulty Level	2
Elementary Secondary Education	2
Regression (Statistics)	2
Response Style (Tests)	2
Responses	2
Test Format	2
Academic Achievement	1
Achievement Tests	1
Adaptive Testing	1
Age Differences	1
Classification	1
Cognitive Ability	1
More ▼

Source

ETS Research Report Series	2
Journal of Educational and…	2
Applied Psychological…	1
International Educational…	1
Large-scale Assessments in…	1

Publication Type

Reports - Research	7
Journal Articles	6
Reports - Evaluative	3
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education	2
Grade 12	2
Grade 4	2
Grade 8	2
Secondary Education	2
Elementary Education	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	11
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

The NAEP EDM Competition: On the Value of Theory-Driven Psychometrics and Machine Learning for Predictions Based on Log Data

Peer reviewed
PDF on ERIC

Download full text

Zehner, Fabian; Harrison, Scott; Eichmann, Beate; Deribo, Tobias; Bengs, Daniel; Andersen, Nico; Hahnel, Carolin – International Educational Data Mining Society, 2020

The "2nd Annual WPI-UMASS-UPENN EDM Data Mining Challenge" required contestants to predict efficient testtaking based on log data. In this paper, we describe our theory-driven and psychometric modeling approach. For feature engineering, we employed the Log-Normal Response Time Model for estimating latent person speed, and the Generalized…

Descriptors: Data Analysis, Competition, Classification, Prediction

Research on Psychometric Modeling, Analysis, and Reporting of the National Assessment of Educational Progress

Peer reviewed
PDF on ERIC

Download full text

Direct link

Oranje, Andreas; Kolstad, Andrew – Journal of Educational and Behavioral Statistics, 2019

The design and psychometric methodology of the National Assessment of Educational Progress (NAEP) is constantly evolving to meet the changing interests and demands stemming from a rapidly shifting educational landscape. NAEP has been built on strong research foundations that include conducting extensive evaluations and comparisons before new…

Descriptors: National Competency Tests, Psychometrics, Statistical Analysis, Computation

The Use of Test Scores from Large-Scale Assessment Surveys: Psychometric and Statistical Considerations

Peer reviewed

Direct link

Braun, Henry; von Davier, Matthias – Large-scale Assessments in Education, 2017

Background: Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV) methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT…

Descriptors: Scores, Test Use, Measurement, Psychometrics

The Performance of IRT Model Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Applied Psychological Measurement, 2012

When tests consist of multiple-choice and constructed-response items, researchers are confronted with the question of which item response theory (IRT) model combination will appropriately represent the data collected from these mixed-format tests. This simulation study examined the performance of six model selection criteria, including the…

Descriptors: Item Response Theory, Models, Selection, Criteria

Stochastic Approximation Methods for Latent Regression Item Response Models

Peer reviewed

Direct link

von Davier, Matthias; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2010

This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…

Descriptors: Item Response Theory, Statistical Analysis, Regression (Statistics), Models

Fitting the Structured General Diagnostic Model to NAEP Data. Research Report. ETS RR-08-27

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008

Xu and von Davier (2006) demonstrated the feasibility of using the general diagnostic model (GDM) to analyze National Assessment of Educational Progress (NAEP) proficiency data. Their work showed that the GDM analysis not only led to conclusions for gender and race groups similar to those published in the NAEP Report Card, but also allowed…

Descriptors: National Competency Tests, Models, Data Analysis, Reading Tests

The Effect of Position and Format on the Difficulty of Assessment Exercises.

Download full text

Burton, Nancy W.; And Others – 1976

Assessment exercises (items) in three different formats--multiple-choice with an "I don't know" (IDK) option, multiple-choice without the IDK, and open-ended--were placed at the beginning, middle and end of 45-minute assessment packages (instruments). A balanced incomplete blocks analysis of variance was computed to determine the biasing…

Descriptors: Age Differences, Difficulty Level, Educational Assessment, Guessing (Tests)

Use of Person-Fit Statistics in Reporting and Analyzing National Assessment of Educational Progress Results. Research and Development Report.

Download full text

Rudner, Lawrence M.; And Others – 1995

Fit statistics provide a direct measure of assessment accuracy by analyzing the fit of measurement models to an individual's (or group's) response pattern. Students that lose interest during the assessment, for example, will miss exercises that are within their abilities. Such students will respond correctly to some more difficult items and…

Descriptors: Difficulty Level, Educational Assessment, Goodness of Fit, Measurement Techniques

IRT as a Way of Improving the Usefulness of Complex Data.

Download full text

Beaton, Albert E.; Johnson, Eugene G. – 1990

When the Educational Testing Service became the administrator of the National Assessment of Educational Progress (NAEP) in 1983, it introduced scales based on item response theory (IRT) as a way of presenting results of the assessment to the general public. Some properties of the scales and their uses are discussed. Initial attempts at presenting…

Descriptors: Academic Achievement, Data Interpretation, Educational Assessment, Elementary Secondary Education

A Sampling of Statistical Problems Encountered at the Educational Testing Service. Program Statistics Research Technical Report No. 92-26.

Download full text

Wainer, Howard; And Others – 1992

Four researchers at the Educational Testing Service describe what they consider some of the most vexing research problems they face. While these problems are not completely statistical, they all have major statistical components. Following the introduction (section 1), in section 2, "Problems with the Simultaneous Estimation of Many True…

Descriptors: Adaptive Testing, Bayesian Statistics, Educational Research, Estimation (Mathematics)

Cognitive Diagnosis for NAEP Proficiency Data. Research Report. ETS RR-06-08

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006

More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…

Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory

von Davier, Matthias	4
Xu, Xueli	2
Andersen, Nico	1
Beaton, Albert E.	1
Bengs, Daniel	1
Braun, Henry	1
Burton, Nancy W.	1
Chang, Wanchen	1
Deribo, Tobias	1
Dodd, Barbara G.	1
Eichmann, Beate	1
Hahnel, Carolin	1
Harrison, Scott	1
Johnson, Eugene G.	1
Kolstad, Andrew	1
Oranje, Andreas	1
Rudner, Lawrence M.	1
Sinharay, Sandip	1
Wainer, Howard	1
Whittaker, Tiffany A.	1
Zehner, Fabian	1
More ▼