ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	2

Descriptor

Comparative Analysis	11
Statistical Distributions	11
Test Items	11
Ability	5
Difficulty Level	4
Item Bias	3
Item Response Theory	3
Raw Scores	3
Sample Size	3
Simulation	3
Statistical Studies	3
Test Theory	3
Academic Standards	2
Criterion Referenced Tests	2
Cutting Scores	2
Equated Scores	2
Judges	2
Mastery Tests	2
Mathematics Tests	2
Models	2
Scores	2
Test Construction	2
Achievement Tests	1
Certification	1
Change	1
More ▼

Source

ACT, Inc.	1
Applied Psychological…	1
Educational and Psychological…	1
Journal of Educational…	1

Publication Type

Reports - Research	6
Speeches/Meeting Papers	6
Reports - Evaluative	5
Journal Articles	3

Education Level

Elementary Education	1
Grade 7	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

South Korea

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

A Comparison of Three IRT Approaches to Examinee Ability Change Modeling in a Single-Group Anchor Test Design

Peer reviewed

Direct link

Paek, Insu; Park, Hyun-Jeong; Cai, Li; Chi, Eunlim – Educational and Psychological Measurement, 2014

Typically a longitudinal growth modeling based on item response theory (IRT) requires repeated measures data from a single group with the same test design. If operational or item exposure problems are present, the same test may not be employed to collect data for longitudinal analyses and tests at multiple time points are constructed with unique…

Descriptors: Item Response Theory, Comparative Analysis, Test Items, Equated Scores

Linking Item Parameters to a Base Scale. ACT Research Report Series, 2009-2

Download full text

Kang, Taehoon; Petersen, Nancy S. – ACT, Inc., 2009

This paper compares three methods of item calibration--concurrent calibration, separate calibration with linking, and fixed item parameter calibration--that are frequently used for linking item parameters to a base scale. Concurrent and separate calibrations were implemented using BILOG-MG. The Stocking and Lord (1983) characteristic curve method…

Descriptors: Standards, Testing Programs, Test Items, Statistical Distributions

Classical Test Theory and Item Response Theory: Analytical and Empirical Comparisons.

Download full text

Hwang, Dae-Yeop – 2002

This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…

Descriptors: Comparative Analysis, Measurement Techniques, Scores, Statistical Distributions

The Robustness of BILOG to Violations of the Assumptions of Unidimensionality of Test Items and Normality of Ability Distribution.

PDF pending restoration

Kirisci, Levent; Hsu, Tse-Chi – 1995

The main goal of this study was to assess how sensitive unidimensional parameter estimates derived from BILOG were when the unidimensionality assumption was violated and the underlying ability distribution was not multivariate normal. A multidimensional three-parameter logistic distribution that was a straightforward generalization of the…

Descriptors: Ability, Comparative Analysis, Correlation, Difficulty Level

An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.

Download full text

Pommerich, Mary; And Others – 1995

The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…

Descriptors: Ability, Comparative Analysis, Difficulty Level, Item Bias

Performance of the Mantel-Haenszel and Simultaneous Item Bias Procedures for Detecting Differential Item Functioning. Laboratory of Psychometric and Evaluative Research Report No. 252.

Download full text

Narayanan, Pankaja; Swaminathan, H. – 1993

The purpose of this study was to compare two non-parametric procedures, the Mantel-Haenszel (MH) procedure and the simultaneous item bias (SIB) procedure, with respect to their Type I error rates and power, and to investigate the conditions under which asymptotic distributional properties of the SIB and MH were obtained. Data were simulated to…

Descriptors: Ability, Comparative Analysis, Computer Simulation, Control Groups

A Comparison of Equal Percentile and Partial Credit Equatings for Performance-Based Assessments Composed of Free-Response Items.

Peer reviewed

Huynh, Huynh; Ferrara, Steven – Journal of Educational Measurement, 1994

Equal percentile (EP) and partial credit (PC) equatings for raw scores from performance-based assessments with free-response items are compared through the use of data from the Maryland School Performance Assessment Program. Results suggest that EP and PC methods do not give equivalent results when distributions are markedly skewed. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Mathematics Tests, Performance Based Assessment

A Preliminary Investigation of Three Compromise Methods for Establishing Cut-Off Scores.

Download full text

Mills, Craig N.; Melican, Gerald J. – 1987

The study compares three methods for establishing cut-off scores that effect a compromise between absolute cut-offs based on item difficulty and relative cut-offs based on expected passing rates. Each method coordinates these two types of information differently. The Beuk method obtains judges' estimates of an absolute cut-off and an expected…

Descriptors: Academic Standards, Certification, Comparative Analysis, Cutting Scores

A Conceptual Analysis of Differential Item Functioning in Terms of a Multidimensional Item Response Model.

Peer reviewed

Camilli, Gregory – Applied Psychological Measurement, 1992

A mathematical model is proposed to describe how group differences in distributions of abilities, which are distinct from the target ability, influence the probability of a correct item response. In the multidimensional approach, differential item functioning is considered a function of the educational histories of the examinees. (SLD)

Descriptors: Ability, Comparative Analysis, Equations (Mathematics), Factor Analysis

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests

An Experimental Study of the Effect of Judges' Knowledge of Item Data on Two Forms of the Angoff Standard Setting Method.

Garrido, Mariquita; Payne, David A. – 1987

Minimum competency cut-off scores on a statistics exam were estimated under four conditions: the Angoff judging method with item data (n=20), and without data available (n=19); and the Modified Angoff method with (n=19), and without (n=19) item data available to judges. The Angoff method required free response percentage estimates (0-100) percent,…

Descriptors: Academic Standards, Comparative Analysis, Criterion Referenced Tests, Cutting Scores

Cai, Li	1
Camilli, Gregory	1
Chi, Eunlim	1
Ferrara, Steven	1
Garrido, Mariquita	1
Hsu, Tse-Chi	1
Huynh, Huynh	1
Hwang, Dae-Yeop	1
Kang, Taehoon	1
Kirisci, Levent	1
Melican, Gerald J.	1
Mills, Craig N.	1
Narayanan, Pankaja	1
Paek, Insu	1
Park, Hyun-Jeong	1
Payne, David A.	1
Petersen, Nancy S.	1
Pommerich, Mary	1
Sarvela, Paul D.	1
Swaminathan, H.	1
More ▼