ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Bayesian Statistics	9
Simulation	9
Test Bias	9
Test Items	4
Adaptive Testing	3
Computation	3
Evaluation Methods	3
Item Analysis	3
Item Response Theory	3
Sample Size	3
Computer Assisted Testing	2
Effect Size	2
Error of Measurement	2
Goodness of Fit	2
Item Banks	2
Markov Processes	2
Monte Carlo Methods	2
Scores	2
Test Validity	2
Ability Identification	1
Branching	1
Comparative Analysis	1
Comparative Testing	1
Computer Oriented Programs	1
Correlation	1
More ▼

Source

Educational and Psychological…	2
Applied Psychological…	1
ETS Research Report Series	1
Educational Measurement:…	1
Journal of Educational and…	1

Author

Weiss, David J.	3
McBride, James R.	2
Beretvas, S. Natasha	1
Blew, Edwin O.	1
Dai, Yunyun	1
Dorans, Neil J.	1
Grant, Mary C.	1
Huang, Hung-Yu	1
Lee, HwaYoung	1
Pine, Steven M.	1
Sinharay, Sandip	1
Wang, Wen-Chung	1
Wyse, Adam E.	1
Zwick, Rebecca	1
More ▼

Publication Type

Reports - Research	9
Journal Articles	6

Education Level

Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Taiwan

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Pre Professional Skills Tests	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

A Mixture Rasch Model with a Covariate: A Simulation Study via Bayesian Markov Chain Monte Carlo Estimation

Peer reviewed

Direct link

Dai, Yunyun – Applied Psychological Measurement, 2013

Mixtures of item response theory (IRT) models have been proposed as a technique to explore response patterns in test data related to cognitive strategies, instructional sensitivity, and differential item functioning (DIF). Estimation proves challenging due to difficulties in identification and questions of effect size needed to recover underlying…

Descriptors: Item Response Theory, Test Bias, Computation, Bayesian Statistics

Evaluation of Two Types of Differential Item Functioning in Factor Mixture Models with Binary Outcomes

Peer reviewed

Direct link

Lee, HwaYoung; Beretvas, S. Natasha – Educational and Psychological Measurement, 2014

Conventional differential item functioning (DIF) detection methods (e.g., the Mantel-Haenszel test) can be used to detect DIF only across observed groups, such as gender or ethnicity. However, research has found that DIF is not typically fully explained by an observed variable. True sources of DIF may include unobserved, latent variables, such as…

Descriptors: Item Analysis, Factor Structure, Bayesian Statistics, Goodness of Fit

A Review of ETS Differential Item Functioning Assessment Procedures: Flagging Rules, Minimum Sample Size Requirements, and Criterion Refinement. Research Report. ETS RR-12-08

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca – ETS Research Report Series, 2012

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. The goal of this project was to review the status of ETS DIF analysis procedures, focusing on three aspects: (a) the nature and stringency of the statistical rules used to flag items, (b) the minimum sample size…

Descriptors: Test Bias, Sample Size, Bayesian Statistics, Evaluation Methods

Higher Order Testlet Response Models for Hierarchical Latent Traits and Testlet-Based Items

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2013

Both testlet design and hierarchical latent traits are fairly common in educational and psychological measurements. This study aimed to develop a new class of higher order testlet response models that consider both local item dependence within testlets and a hierarchy of latent traits. Due to high dimensionality, the authors adopted the Bayesian…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Using Past Data to Enhance Small Sample DIF Estimation: A Bayesian Approach

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O. – Journal of Educational and Behavioral Statistics, 2009

Test administrators often face the challenge of detecting differential item functioning (DIF) with samples of size smaller than that recommended by experts. A Bayesian approach can incorporate, in the form of a prior distribution, existing information on the inference problem at hand, which yields more stable estimation, especially for small…

Descriptors: Test Bias, Computation, Bayesian Statistics, Data

Some Properties of a Bayesian Adaptive Ability Testing Strategy.

Download full text

McBride, James R.; Weiss, David J. – 1976

Four monte carlo simulation studies of Owen's Bayesian sequential procedure for adaptive mental testing were conducted. Whereas previous simulation studies of this procedure have concentrated on evaluating it in terms of the correlation of its test scores with simulated ability in a normal population, these four studies explored a number of…

Descriptors: Adaptive Testing, Bayesian Statistics, Branching, Computer Oriented Programs

Bias and Information of Bayesian Adaptive Testing. Research Report 83-2.

Download full text

Weiss, David J.; McBride, James R. – 1983

Monte Carlo simulation was used to investigate score bias and information characteristics of Owen's Bayesian adaptive testing strategy, and to examine possible causes of score bias. Factors investigated in three related studies included effects of item discrimination, effects of fixed vs. variable test length, and effects of an accurate prior…

Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

A Comparison of the Fairness of Adaptive and Conventional Testing Strategies. Research Report 78-1.

Download full text

Pine, Steven M.; Weiss, David J. – 1978

This report examines how selection fairness is influenced by the characteristics of a selection instrument in terms of its distribution of item difficulties, level of item discrimination, degree of item bias, and testing strategy. Computer simulation was used in the administration of either a conventional or Bayesian adaptive ability test to a…

Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Testing, Computer Assisted Testing