ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	9

Descriptor

Computation	10
Statistical Distributions	10
Test Items	10
Difficulty Level	5
Item Response Theory	5
Statistical Analysis	5
Bayesian Statistics	4
Maximum Likelihood Statistics	4
Equated Scores	3
Models	3
Monte Carlo Methods	3
Sample Size	3
Ability	2
Achievement Tests	2
Adaptive Testing	2
Correlation	2
Goodness of Fit	2
Statistical Bias	2
Causal Models	1
Change	1
College Students	1
Comparative Analysis	1
Computer Assisted Testing	1
Equations (Mathematics)	1
Error of Measurement	1
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	2
ETS Research Report Series	1
ProQuest LLC	1
Psychometrika	1

Publication Type

Journal Articles	9
Reports - Research	8
Dissertations/Theses -…	1
Reports - Evaluative	1

Education Level

Elementary Education	1
Grade 7	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

South Korea

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Pretest Item Calibration in Computerized Multistage Adaptive Testing

Peer reviewed

Direct link

Ersen, Rabia Karatoprak; Lee, Won-Chan – Journal of Educational Measurement, 2023

The purpose of this study was to compare calibration and linking methods for placing pretest item parameter estimates on the item pool scale in a 1-3 computerized multistage adaptive testing design in terms of item parameter recovery. Two models were used: embedded-section, in which pretest items were administered within a separate module, and…

Descriptors: Pretesting, Test Items, Computer Assisted Testing, Adaptive Testing

The Role of Item Distributions on Reliability Estimation: The Case of Cronbach's Coefficient Alpha

Peer reviewed

Direct link

Olvera Astivia, Oscar Lorenzo; Kroc, Edward; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

Simulations concerning the distributional assumptions of coefficient alpha are contradictory. To provide a more principled theoretical framework, this article relies on the Fréchet-Hoeffding bounds, in order to showcase that the distribution of the items play a role on the estimation of correlations and covariances. More specifically, these bounds…

Descriptors: Test Items, Test Reliability, Computation, Correlation

Enhancing the Equating of Item Difficulty Metrics: Estimation of Reference Distribution. Research Report. ETS RR-14-07

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Walker, Michael E. – ETS Research Report Series, 2014

Two methods are currently in use at Educational Testing Service (ETS) for equating observed item difficulty statistics. The first method involves the linear equating of item statistics in an observed sample to reference statistics on the same items. The second method, or the item response curve (IRC) method, involves the summation of conditional…

Descriptors: Difficulty Level, Test Items, Equated Scores, Causal Models

Situations Where It Is Appropriate to Use Frequency Estimation Equipercentile Equating

Peer reviewed

Direct link

Guo, Hongwen; Oh, Hyeonjoo J.; Eignor, Daniel – Journal of Educational Measurement, 2013

In operational equating situations, frequency estimation equipercentile equating is considered only when the old and new groups have similar abilities. The frequency estimation assumptions are investigated in this study under various situations from both the levels of theoretical interest and practical use. It shows that frequency estimation…

Descriptors: Equated Scores, Computation, Statistical Analysis, Test Items

Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

Peer reviewed

Direct link

Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…

Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics

Coefficient Omega Bootstrap Confidence Intervals: Nonnormal Distributions

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin – Educational and Psychological Measurement, 2013

The performance of the normal theory bootstrap (NTB), the percentile bootstrap (PB), and the bias-corrected and accelerated (BCa) bootstrap confidence intervals (CIs) for coefficient omega was assessed through a Monte Carlo simulation under conditions not previously investigated. Of particular interests were nonnormal Likert-type and binary items.…

Descriptors: Sampling, Statistical Inference, Computation, Statistical Analysis

The Performance of the Linear Logistic Test Model When the Q-Matrix Is Misspecified: A Simulation Study

Direct link

MacDonald, George T. – ProQuest LLC, 2014

A simulation study was conducted to explore the performance of the linear logistic test model (LLTM) when the relationships between items and cognitive components were misspecified. Factors manipulated included percent of misspecification (0%, 1%, 5%, 10%, and 15%), form of misspecification (under-specification, balanced misspecification, and…

Descriptors: Simulation, Item Response Theory, Models, Test Items

A Comparison of Three IRT Approaches to Examinee Ability Change Modeling in a Single-Group Anchor Test Design

Peer reviewed

Direct link

Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014

Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…

Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores

l[subscript z] Person-Fit Index to Identify Misfit Students with Achievement Test Data

Peer reviewed

Direct link

Seo, Dong Gi; Weiss, David J. – Educational and Psychological Measurement, 2013

The usefulness of the l[subscript z] person-fit index was investigated with achievement test data from 20 exams given to more than 3,200 college students. Results for three methods of estimating ? showed that the distributions of l[subscript z] were not consistent with its theoretical distribution, resulting in general overfit to the item response…

Descriptors: Achievement Tests, College Students, Goodness of Fit, Item Response Theory

Empirical Bayes Estimates of Domain Scores under Binomial and Hypergeometric Distributions for Test Scores.

Peer reviewed

Lin, Miao-Hsiang; Hsiung, Chao A. – Psychometrika, 1994

Two simple empirical approximate Bayes estimators are introduced for estimating domain scores under binomial and hypergeometric distributions respectively. Criteria are established regarding use of these functions over maximum likelihood estimation counterparts. (SLD)

Descriptors: Adaptive Testing, Bayesian Statistics, Computation, Equations (Mathematics)

Ali, Usama S.	1
Cai, Li	1
Chi, Eunlim	1
Divers, Jasmin	1
Edwards, Julianne M.	1
Eignor, Daniel	1
Ersen, Rabia Karatoprak	1
Finch, Holmes	1
Guo, Hongwen	1
Hsiung, Chao A.	1
Kroc, Edward	1
Lee, Won-Chan	1
Lin, Miao-Hsiang	1
MacDonald, George T.	1
Oh, Hyeonjoo J.	1
Olvera Astivia, Oscar Lorenzo	1
Padilla, Miguel A.	1
Paek, Insu	1
Park, Hyun-Jeong	1
Seo, Dong Gi	1
Walker, Michael E.	1
Weiss, David J.	1
Zumbo, Bruno D.	1
More ▼