ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	8

Descriptor

Item Response Theory	7
Grade 5	5
Grade 4	4
Grade 6	4
Grade 7	4
Grade 8	4
Test Items	4
Achievement Tests	3
Grade 3	3
Measurement	3
Test Bias	3
Comparative Analysis	2
Computer Assisted Testing	2
Elementary Secondary Education	2
Scaling	2
Science Tests	2
Test Format	2
Testing Accommodations	2
Academic Ability	1
Adaptive Testing	1
Affective Behavior	1
Alternative Assessment	1
Barriers	1
Bias	1
Classification	1
More ▼

Source

Applied Measurement in…

Author

Albano, Anthony D.	1
Beretvas, S. Natasha	1
Cho, Hyun-Jeong	1
Gattamorta, Karina A.	1
Henly, George A.	1
Janssen, Rianne	1
Kingsbury, G. Gage	1
Kingston, Neal	1
Kolen, Michael J.	1
Lee, Jaehoon	1
Murphy, Daniel L.	1
Penfield, Randall D.	1
Tong, Ye	1
Van Nijlen, Daniel	1
Wan, Lei	1
Wise, Steven L.	1
Wyse, Adam E.	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	2

Education Level

Grade 5	8
Elementary Education	6
Grade 4	6
Grade 6	6
Grade 8	6
Grade 3	5
Grade 7	5
Middle Schools	5
Elementary Secondary Education	4
Intermediate Grades	4
Junior High Schools	4
Secondary Education	4
Early Childhood Education	2
High Schools	2
Primary Education	2
Grade 2	1
Grade 9	1
More ▼

Audience

Location

Belgium	1
Colorado	1
Florida	1
New York	1
North Carolina	1
Tennessee	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Measures of Academic Progress

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Performance Decline as an Indicator of Generalized Test-Taking Disengagement

Peer reviewed

Direct link

Wise, Steven L.; Kingsbury, G. Gage – Applied Measurement in Education, 2022

In achievement testing we assume that students will demonstrate their maximum performance as they encounter test items. Sometimes, however, student performance can decline during a test event, which implies that the test score does not represent maximum performance. This study describes a method for identifying significant performance decline and…

Descriptors: Achievement Tests, Performance, Classification, Guessing (Tests)

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

A Comparison of Adjacent Categories and Cumulative Differential Step Functioning Effect Estimators

Peer reviewed

Direct link

Gattamorta, Karina A.; Penfield, Randall D. – Applied Measurement in Education, 2012

The study of measurement invariance in polytomous items that targets individual score levels is known as differential step functioning (DSF). The analysis of DSF requires the creation of a set of dichotomizations of the item response variable. There are two primary approaches for creating the set of dichotomizations to conduct a DSF analysis: the…

Descriptors: Measurement, Item Response Theory, Test Bias, Test Items

Measuring Mastery across Grades: An Application to Spelling Ability

Peer reviewed

Direct link

Van Nijlen, Daniel; Janssen, Rianne – Applied Measurement in Education, 2011

The distinction between quantitative and qualitative differences in mastery is essential when monitoring student progress and is crucial for instructional interventions to deal with learning difficulties. Mixture item response theory (IRT) models can provide a convenient way to make the distinction between quantitative and qualitative differences…

Descriptors: Spelling, Indo European Languages, Vowels, Verbal Tests

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Examining the Effectiveness of Test Accommodation Using DIF and a Mixture IRT Model

Peer reviewed

Direct link

Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012

This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…

Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity

Comparisons of Methodologies and Results in Vertical Scaling for Educational Achievement Tests

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Applied Measurement in Education, 2007

A number of vertical scaling methodologies were examined in this article. Scaling variations included data collection design, scaling method, item response theory (IRT) scoring procedure, and proficiency estimation method. Vertical scales were developed for Grade 3 through Grade 8 for 4 content areas and 9 simulated datasets. A total of 11 scaling…

Descriptors: Achievement Tests, Scaling, Methods, Item Response Theory