ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	10

Descriptor

Item Analysis	12
Item Response Theory	12
Testing Programs	12
Test Items	7
Evaluation Methods	4
Test Construction	4
Mathematics Tests	3
Models	3
Standardized Tests	3
Test Validity	3
Achievement Tests	2
Adaptive Testing	2
Comparative Analysis	2
Computer Assisted Testing	2
Difficulty Level	2
Elementary Secondary Education	2
Equated Scores	2
Error of Measurement	2
Grade 11	2
Group Testing	2
High School Students	2
Psychometrics	2
State Programs	2
State Standards	2
Ability Grouping	1
More ▼

Source

Applied Measurement in…	3
Journal of Applied Testing…	2
Journal of Educational…	2
ACT, Inc.	1
Educational and Psychological…	1
Elementary School Journal	1
Journal of Educational and…	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	5
Numerical/Quantitative Data	1

Education Level

Elementary Secondary Education	2
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 1	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

Florida	1
Singapore	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

Applying Multidimensional Item Response Theory Models in Validating Test Dimensionality: An Example of K-12 Large-Scale Science Assessment

Peer reviewed

Direct link

Li, Ying; Jiao, Hong; Lissitz, Robert W. – Journal of Applied Testing Technology, 2012

This study investigated the application of multidimensional item response theory (IRT) models to validate test structure and dimensionality. Multiple content areas or domains within a single subject often exist in large-scale achievement tests. Such areas or domains may cause multidimensionality or local item dependence, which both violate the…

Descriptors: Achievement Tests, Science Tests, Item Response Theory, Measures (Individuals)

Design of a Computer-Adaptive Test to Measure English Literacy and Numeracy in the Singapore Workforce: Considerations, Benefits, and Implications

Peer reviewed

Direct link

Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011

A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…

Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

Identification of Reading Problems in First Grade within a Response-to-Intervention Framework

Peer reviewed

Direct link

Speece, Deborah L.; Schatschneider, Christopher; Silverman, Rebecca; Case, Lisa Pericola; Cooper, David H.; Jacobs, Dawn M. – Elementary School Journal, 2011

Models of Response to Intervention (RTI) include parameters of assessment and instruction. This study focuses on assessment with the purpose of developing a screening battery that validly and efficiently identifies first-grade children at risk for reading problems. In an RTI model, these children would be candidates for early intervention. We…

Descriptors: Reading Difficulties, Early Intervention, Grade 1, Response to Intervention

Integrating Cognitive and Psychometric Models to Measure Document Literacy.

Peer reviewed

Sheehan, Kathleen; Mislevy, Robert J. – Journal of Educational Measurement, 1990

The 63 items on skills in acquiring and using information from written documents contained in the Survey of Young Adult Literacy in the 1985 National Assessment of Educational Progress are analyzed. The analyses are based on a qualitative cognitive model and an item-response theory model. (TJH)

Descriptors: Adult Literacy, Cognitive Processes, Diagnostic Tests, Elementary Secondary Education

Procedures for Scaling the 1990 Edition of the Nevada Proficiency Examinations in Reading and Mathematics.

Download full text

Klein, Thomas W. – 1991

Steps involved in the item analysis and scaling of the 1990 edition of Forms A and B of the Nevada High School Proficiency Examinations (NHSPEs) are described. Pilot tests of Forms A and B of the 47-item reading and 45-item mathematics tests were each administered to random samples of more than 600 eleventh-grade students. A computer program was…

Descriptors: Achievement Tests, Cutting Scores, Grade 11, High School Students

Albano, Anthony D.	2
Ackermann, Richard	1
Brian F. French	1
Case, Lisa Pericola	1
Chen, Hanwei	1
Cooper, David H.	1
Cui, Zhongmin	1
Eguez, Jane	1
Ganguli, Debalina	1
Gao, Xiaohong	1
Guo, Hongwen	1
Huggins-Manley, Anne Corinne	1
Jacobs, Dawn M.	1
Jacobsen, Jared	1
Jiao, Hong	1
Klein, Thomas W.	1
Leite, Walter	1
Li, Ying	1
Lissitz, Robert W.	1
Mislevy, Robert J.	1
Phillips, Gary W.	1
Rickard, Patricia	1
Schatschneider, Christopher	1
Sheehan, Kathleen	1
Silverman, Rebecca	1
More ▼