ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	19

Descriptor

Comparative Analysis	28
Maximum Likelihood Statistics	28
Test Items	28
Computation	12
Item Response Theory	11
Simulation	9
Adaptive Testing	8
Bayesian Statistics	7
Models	7
Computer Assisted Testing	6
Difficulty Level	6
Statistical Analysis	6
Accuracy	5
Mathematical Models	5
Monte Carlo Methods	5
Test Length	5
Error of Measurement	4
Estimation (Mathematics)	4
Statistical Bias	4
Test Bias	4
Ability	3
Foreign Countries	3
Markov Processes	3
Methods	3
Sample Size	3
More ▼

Source

Applied Measurement in…	3
ETS Research Report Series	3
Journal of Educational and…	3
Educational and Psychological…	2
Applied Psychological…	1
Educational Evaluation and…	1
Educational Measurement:…	1
International Journal of…	1
Journal of Educational…	1
Language Testing	1
Multivariate Behavioral…	1
Online Submission	1
Perspectives in Education	1
ProQuest LLC	1
Psychometrika	1
More ▼

Publication Type

Reports - Research	23
Journal Articles	20
Speeches/Meeting Papers	4
Reports - Evaluative	3
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Higher Education	3
Postsecondary Education	3
Grade 3	1

Audience

Location

Japan	1
Saudi Arabia	1
Sweden	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Educational…	1
National Assessment of…	1
Raven Advanced Progressive…	1
School and College Ability…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Comparison of Confirmatory Factor Analysis Estimation Methods on Mixed-Format Data

Peer reviewed
PDF on ERIC

Download full text

Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021

Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…

Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level

Peer reviewed

Direct link

Savalei, Victoria; Rhemtulla, Mijke – Journal of Educational and Behavioral Statistics, 2017

In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately…

Descriptors: Computation, Statistical Analysis, Test Items, Maximum Likelihood Statistics

Do Adaptive Representations of the Item-Position Effect in APM Improve Model Fit? A Simulation Study

Peer reviewed

Direct link

Zeller, Florian; Krampen, Dorothea; Reiß, Siegbert; Schweizer, Karl – Educational and Psychological Measurement, 2017

The item-position effect describes how an item's position within a test, that is, the number of previous completed items, affects the response to this item. Previously, this effect was represented by constraints reflecting simple courses, for example, a linear increase. Due to the inflexibility of these representations our aim was to examine…

Descriptors: Goodness of Fit, Simulation, Factor Analysis, Intelligence Tests

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

Assessing Rasch Measurement Estimation Methods across R Packages with Yes/No Vocabulary Test Data

Peer reviewed

Direct link

Nicklin, Christopher; Vitta, Joseph P. – Language Testing, 2022

Instrument measurement conducted with Rasch analysis is a common process in language assessment research. A recent systematic review of 215 studies involving Rasch analysis in language testing and applied linguistics research reported that 23 different software packages had been utilized. However, none of the analyses were conducted with one of…

Descriptors: Programming Languages, Vocabulary Development, Language Tests, Computer Software

A Strategy for Replacing Sum Scoring

Peer reviewed

Direct link

Ramsay, James O.; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2017

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…

Descriptors: Scoring, Test Theory, Computation, Maximum Likelihood Statistics

A Maximum Likelihood Based Offline Estimation of Student Capabilities and Question Difficulties with Guessing

Peer reviewed

Direct link

Moothedath, Shana; Chaporkar, Prasanna; Belur, Madhu N. – Perspectives in Education, 2016

In recent years, the computerised adaptive test (CAT) has gained popularity over conventional exams in evaluating student capabilities with desired accuracy. However, the key limitation of CAT is that it requires a large pool of pre-calibrated questions. In the absence of such a pre-calibrated question bank, offline exams with uncalibrated…

Descriptors: Guessing (Tests), Computer Assisted Testing, Adaptive Testing, Maximum Likelihood Statistics

Comparing Three Estimation Methods for the Three-Parameter Logistic IRT Model

Direct link

Lamsal, Sunil – ProQuest LLC, 2015

Different estimation procedures have been developed for the unidimensional three-parameter item response theory (IRT) model. These techniques include the marginal maximum likelihood estimation, the fully Bayesian estimation using Markov chain Monte Carlo simulation techniques, and the Metropolis-Hastings Robbin-Monro estimation. With each…

Descriptors: Item Response Theory, Monte Carlo Methods, Maximum Likelihood Statistics, Markov Processes

Parameter Recovery and Classification Accuracy under Conditions of Testlet Dependency: A Comparison of the Traditional 2PL, Testlet, and Bi-Factor Models

Peer reviewed

Direct link

Koziol, Natalie A. – Applied Measurement in Education, 2016

Testlets, or groups of related items, are commonly included in educational assessments due to their many logistical and conceptual advantages. Despite their advantages, testlets introduce complications into the theory and practice of educational measurement. Responses to items within a testlet tend to be correlated even after controlling for…

Descriptors: Classification, Accuracy, Comparative Analysis, Models

A Comparison of Scoring Options for Omitted and Not-Reached Items through the Recovery of IRT Parameters When Utilizing the Rasch Model and Joint Maximum Likelihood Estimation

Download full text

Custer, Michael; Sharairi, Sid; Swift, David – Online Submission, 2012

This paper utilized the Rasch model and Joint Maximum Likelihood Estimation to study different scoring options for omitted and not-reached items. Three scoring treatments were studied. The first method treated omitted and not-reached items as "ignorable/blank". The second treatment, scored omits as incorrect with "0" and left not-reached as blank…

Descriptors: Scoring, Test Items, Item Response Theory, Maximum Likelihood Statistics

The Influence of Rapidly Guessed Item Responses on Teacher Value-Added Estimates: Implications for Policy and Practice

Peer reviewed

Direct link

Jensen, Nate; Rice, Andrew; Soland, James – Educational Evaluation and Policy Analysis, 2018

While most educators assume that not all students try their best on achievement tests, no current research examines if behaviors associated with low test effort, like rapidly guessing on test items, affect teacher value-added estimates. In this article, we examined the prevalence of rapid guessing to determine if this behavior varied by grade,…

Descriptors: Item Response Theory, Value Added Models, Achievement Tests, Test Items

Item Selection and Ability Estimation Procedures for a Mixed-Format Adaptive Test

Peer reviewed

Direct link

Ho, Tsung-Han; Dodd, Barbara G. – Applied Measurement in Education, 2012

In this study we compared five item selection procedures using three ability estimation methods in the context of a mixed-format adaptive test based on the generalized partial credit model. The item selection procedures used were maximum posterior weighted information, maximum expected information, maximum posterior weighted Kullback-Leibler…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Selection

Online Calibration Methods for the DINA Model with Independent Attributes in CD-CAT

Peer reviewed

Direct link

Chen, Ping; Xin, Tao; Wang, Chun; Chang, Hua-Hua – Psychometrika, 2012

Item replenishing is essential for item bank maintenance in cognitive diagnostic computerized adaptive testing (CD-CAT). In regular CAT, online calibration is commonly used to calibrate the new items continuously. However, until now no reference has publicly become available about online calibration for CD-CAT. Thus, this study investigates the…

Descriptors: Computer Assisted Testing, Adaptive Testing, Diagnostic Tests, Cognitive Tests

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Previous Page | Next Page »

Pages: 1 | 2

Dodd, Barbara G.	2
McKinley, Robert L.	2
Reckase, Mark D.	2
Zhang, Jinming	2
Ban, Jae-Chun	1
Belur, Madhu N.	1
Chang, Hua-Hua	1
Chaporkar, Prasanna	1
Chen, Ping	1
Culpepper, Steven Andrew	1
Custer, Michael	1
De Ayala, R. J.	1
Dogan, Nuri	1
Finch, Holmes	1
French, Brian F.	1
Hanson, Bradley A.	1
Harris, Deborah J.	1
He, Wei	1
Ho, Tsung-Han	1
Hsu, Tse-Chi	1
Jensen, Nate	1
Jiao, Hong	1
Kilic, Abdullah Faruk	1
Kirisci, Levent	1
More ▼