ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	7

Descriptor

Difficulty Level	17
Item Analysis	17
Test Items	12
Item Response Theory	6
Comparative Analysis	5
Computer Assisted Testing	4
Simulation	4
Test Construction	3
Test Wiseness	3
Adaptive Testing	2
College Entrance Examinations	2
Error of Measurement	2
Factor Analysis	2
Guessing (Tests)	2
Higher Education	2
Hypothesis Testing	2
Mathematics Tests	2
Models	2
Multiple Choice Tests	2
Probability	2
Psychometrics	2
Scores	2
Standardized Tests	2
Test Reliability	2
Test Validity	2
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	15
Reports - Research	12
Reports - Evaluative	3

Education Level

Elementary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Graduate Record Examinations	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Efficiency of Targeted Multistage Calibration Designs under Practical Constraints: A Simulation Study

Peer reviewed

Direct link

Berger, Stéphanie; Verschoor, Angela J.; Eggen, Theo J. H. M.; Moser, Urs – Journal of Educational Measurement, 2019

Calibration of an item bank for computer adaptive testing requires substantial resources. In this study, we investigated whether the efficiency of calibration under the Rasch model could be enhanced by improving the match between item difficulty and student ability. We introduced targeted multistage calibration designs, a design type that…

Descriptors: Simulation, Computer Assisted Testing, Test Items, Difficulty Level

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Peer reviewed

Direct link

Albano, Anthony D.; Cai, Liuhan; Lease, Erin M.; McConnell, Scott R. – Journal of Educational Measurement, 2019

Studies have shown that item difficulty can vary significantly based on the context of an item within a test form. In particular, item position may be associated with practice and fatigue effects that influence item parameter estimation. The purpose of this research was to examine the relevance of item position specifically for assessments used in…

Descriptors: Test Items, Computer Assisted Testing, Item Analysis, Difficulty Level

Subjective Priors for Item Response Models: Application of Elicitation by Design

Peer reviewed

Direct link

Ames, Allison; Smith, Elizabeth – Journal of Educational Measurement, 2018

Bayesian methods incorporate model parameter information prior to data collection. Eliciting information from content experts is an option, but has seen little implementation in Bayesian item response theory (IRT) modeling. This study aims to use ethical reasoning content experts to elicit prior information and incorporate this information into…

Descriptors: Item Response Theory, Bayesian Statistics, Ethics, Specialists

Monitoring Items in Real Time to Enhance CAT Security

Peer reviewed

Direct link

Zhang, Jinming; Li, Jie – Journal of Educational Measurement, 2016

An IRT-based sequential procedure is developed to monitor items for enhancing test security. The procedure uses a series of statistical hypothesis tests to examine whether the statistical characteristics of each item under inspection have changed significantly during CAT administration. This procedure is compared with a previously developed…

Descriptors: Computer Assisted Testing, Test Items, Difficulty Level, Item Response Theory

Application of the Bi-Factor Multidimensional Item Response Theory Model to Testlet-Based Tests

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational Measurement, 2006

Four item response theory (IRT) models were compared using data from tests where multiple items were grouped into testlets focused on a common stimulus. In the bi-factor model each item was treated as a function of a primary trait plus a nuisance trait due to the testlet; in the testlet-effects model the slopes in the direction of the testlet…

Descriptors: Item Response Theory, Reliability, Item Analysis, Factor Analysis

Some Relationships among Internal Consistency, Reproducibility, and Homogeneity.

Peer reviewed

Terwilliger, James S.; Lele, Kaustubh – Journal of Educational Measurement, 1979

Different indices for the internal consistency, reproducibility, or homogeneity of a test are based upon highly similar conceptual frameworks. Illustrations are presented to demonstrate how the maximum and minimum values of KR20 are influenced by test difficulty and the shape of the distribution of test scores. (Author/CTM)

Descriptors: Difficulty Level, Item Analysis, Mathematical Formulas, Statistical Analysis

Item Difficulty Level and Sequence Effects in Multiple-Choice Achievement Tests

Peer reviewed

Huck, Schuyler W.; Bowers, Norman D. – Journal of Educational Measurement, 1972

Study investigated whether the proportion of examinees who answer an item correctly may be influenced by the difficulty of the immediately preceding item. (Authors/MB)

Descriptors: Achievement Tests, Difficulty Level, Hypothesis Testing, Item Analysis

The Impact of Item Deletion on Equating Conversions and Reported Score Distributions.

Peer reviewed

Dorans, Neil J. – Journal of Educational Measurement, 1986

The analytical decomposition demonstrates how the effects of item characteristics, test properties, individual examinee responses, and rounding rules combine to produce the item deletion effect on the equating/scaling function and candidate scores. The empirical portion of the report illustrates the effects of item deletion on reported score…

Descriptors: Difficulty Level, Equated Scores, Item Analysis, Latent Trait Theory

A Closer Look at Using Judgments of Item Difficulty to Change Answers on Computerized Adaptive Tests

Peer reviewed

Direct link

Vispoel, Walter P.; Clough, Sara J.; Bleiler, Timothy – Journal of Educational Measurement, 2005

Recent studies have shown that restricting review and answer change opportunities on computerized adaptive tests (CATs) to items within successive blocks reduces time spent in review, satisfies most examinees' desires for review, and controls against distortion in proficiency estimates resulting from intentional incorrect answering of items prior…

Descriptors: Mathematics, Item Analysis, Adaptive Testing, Computer Assisted Testing

Sensitivity of Item Difficulties to Curricular Validity.

Peer reviewed

Mehrens, William A.; Phillips, S. E. – Journal of Educational Measurement, 1987

A taxonomic matrix classification was used to assess the curricular validity of the Stanford Achievement Tests for the mathematics textbooks used in a school district's fifth and sixth grades. Rasch item difficulty was also examined. Results indicated only small differences between textbooks. (GDC)

Descriptors: Difficulty Level, Elementary School Mathematics, Intermediate Grades, Item Analysis

The Relationship of Content Characteristics of GRE Analytical Reasoning Items to Their Difficulties and Discriminations.

Peer reviewed

Chalifour, Clark L.; Powers, Donald E. – Journal of Educational Measurement, 1989

Content characteristics of 1,400 Graduate Record Examination (GRE) analytical reasoning items were coded for item difficulty and discrimination. The results provide content characteristics for consideration in extending specifications for analytical reasoning items and a better understanding of the construct validity of these items. (TJH)

Descriptors: College Entrance Examinations, Construct Validity, Content Analysis, Difficulty Level

The Use of Hierarchical Generalized Linear Model for Item Dimensionality Assessment

Peer reviewed

Direct link

Beretvas, S. Natasha; Williams, Natasha J. – Journal of Educational Measurement, 2004

To assess item dimensionality, the following two approaches are described and compared: hierarchical generalized linear model (HGLM) and multidimensional item response theory (MIRT) model. Two generating models are used to simulate dichotomous responses to a 17-item test: the unidimensional and compensatory two-dimensional (C2D) models. For C2D…

Descriptors: Item Response Theory, Test Items, Mathematics Tests, Reading Ability

Test Performance Under The Condition of Known Item Difficulty

Peer reviewed

Huck, Schuyler W. – Journal of Educational Measurement, 1978

Providing examinees with advanced knowledge of the difficulty of an item led to an increase in test performance with no loss of reliability. This finding was consistent across several test formats. ( Author/JKS)

Descriptors: Difficulty Level, Feedback, Higher Education, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Huck, Schuyler W.	2
Albano, Anthony D.	1
Ames, Allison	1
Bennett, Randy Elliot	1
Beretvas, S. Natasha	1
Berger, Stéphanie	1
Bleiler, Timothy	1
Bowers, Norman D.	1
Cai, Liuhan	1
Chalifour, Clark L.	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Clough, Sara J.	1
DeCarlo, Lawrence T.	1
DeMars, Christine E.	1
Dorans, Neil J.	1
Eggen, Theo J. H. M.	1
Kane, Michael	1
Kirsch, Irwin S.	1
Lease, Erin M.	1
Lele, Kaustubh	1
Li, Jie	1
McConnell, Scott R.	1
Mehrens, William A.	1
More ▼