ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Source

Applied Measurement in…

Author

Chen, Lisue	1
Chis, Liliana	1
Clark, Amy K.	1
Clauser, Brian E.	1
Gao, Furong	1
Haberman, Shelby	1
Harik, Polina	1
Larkin, Kevin	1
Margolis, Melissa J.	1
McManus, I. C.	1
Mollon, Jennifer	1
Nash, Brooke	1
Paek, Insu	1
Puhan, Gautam	1
Sinharay, Sandip	1
Thompson, W. Jake	1
Williams, Simon	1
Young, Michael J.	1
More ▼

Publication Type

Journal Articles	5
Reports - Evaluative	5

Education Level

Audience

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Measuring the Reliability of Diagnostic Mastery Classifications at Multiple Levels of Reporting

Peer reviewed

Direct link

Thompson, W. Jake; Clark, Amy K.; Nash, Brooke – Applied Measurement in Education, 2019

As the use of diagnostic assessment systems transitions from research applications to large-scale assessments for accountability purposes, reliability methods that provide evidence at each level of reporting are needed. The purpose of this paper is to summarize one simulation-based method for estimating and reporting reliability for an…

Descriptors: Test Reliability, Diagnostic Tests, Classification, Computation

The Utility of Augmented Subscores in a Licensure Exam: An Evaluation of Methods Using Empirical Data

Peer reviewed

Direct link

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010

Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…

Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…

Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring

Bayesian or Non-Bayesian: A Comparison Study of Item Parameter Estimation in the Three-Parameter Logistic Model

Peer reviewed

Direct link

Gao, Furong; Chen, Lisue – Applied Measurement in Education, 2005

Through a large-scale simulation study, this article compares item parameter estimates obtained by the marginal maximum likelihood estimation (MMLE) and marginal Bayes modal estimation (MBME) procedures in the 3-parameter logistic model. The impact of different prior specifications on the MBME estimates is also investigated using carefully…

Descriptors: Simulation, Computation, Bayesian Statistics, Item Analysis

Investigation of Student Growth Recovery in a Fixed-Item Linking Procedure with a Fixed-Person Prior Distribution for Mixed-Format Test Data

Peer reviewed

Direct link

Paek, Insu; Young, Michael J. – Applied Measurement in Education, 2005

When the item response theory (IRT) model uses the marginal maximum likelihood estimation, person parameters are usually treated as random parameters following a certain distribution as a prior distribution to estimate the structural parameters in the model. For example, both PARSCALE (Muraki & Bock, 1999) and BILOG 3 (Mislevy & Bock,…

Descriptors: Item Response Theory, Test Items, Maximum Likelihood Statistics, Test Bias

Computation	5
Evaluation Methods	2
Maximum Likelihood Statistics	2
Simulation	2
Test Items	2
Academic Achievement	1
Academic Standards	1
Alternative Assessment	1
Bayesian Statistics	1
Classification	1
Comparative Analysis	1
Credentials	1
Cutting Scores	1
Diagnostic Tests	1
Difficulty Level	1
Foreign Countries	1
Generalizability Theory	1
Group Discussion	1
Item Analysis	1
Item Response Theory	1
Licensing Examinations…	1
Measurement	1
Methods	1
Physicians	1
Prediction	1
More ▼