ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	8

Source

Applied Measurement in…

Author

Albano, Anthony D.	1
Beretvas, S. Natasha	1
Cho, Hyun-Jeong	1
Janssen, Rianne	1
Kingsbury, G. Gage	1
Kingston, Neal	1
Kolen, Michael J.	1
Lee, Jaehoon	1
Lee, Yoonsun	1
Michaelides, Michalis P.	1
Murphy, Daniel L.	1
Taylor, Catherine S.	1
Tong, Ye	1
Van Nijlen, Daniel	1
Wise, Steven L.	1
Wyse, Adam E.	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Evaluative	1

Education Level

Grade 4	8
Elementary Education	6
Grade 5	6
Grade 6	6
Grade 7	6
Grade 3	5
Intermediate Grades	5
Elementary Secondary Education	4
Grade 8	4
Junior High Schools	4
Middle Schools	4
Secondary Education	3
Early Childhood Education	2
Primary Education	2
Grade 10	1
Grade 2	1
Grade 9	1
High Schools	1
More ▼

Audience

Location

Belgium	1
Colorado	1
Finland	1
Florida	1
Germany	1
Italy	1
New York	1
North Carolina	1
Romania	1
Russia	1
Tennessee	1
Texas	1
United Kingdom (Northern…	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Measures of Academic Progress	1
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Performance Decline as an Indicator of Generalized Test-Taking Disengagement

Peer reviewed

Direct link

Wise, Steven L.; Kingsbury, G. Gage – Applied Measurement in Education, 2022

In achievement testing we assume that students will demonstrate their maximum performance as they encounter test items. Sometimes, however, student performance can decline during a test event, which implies that the test score does not represent maximum performance. This study describes a method for identifying significant performance decline and…

Descriptors: Achievement Tests, Performance, Classification, Guessing (Tests)

Negative Keying Effects in the Factor Structure of TIMSS 2011 Motivation Scales and Associations with Reading Achievement

Peer reviewed

Direct link

Michaelides, Michalis P. – Applied Measurement in Education, 2019

The Student Background survey administered along with achievement tests in studies of the International Association for the Evaluation of Educational Achievement includes scales of student motivation, competence, and attitudes toward mathematics and science. The scales consist of positively- and negatively keyed items. The current research…

Descriptors: International Assessment, Achievement Tests, Mathematics Achievement, Mathematics Tests

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Measuring Mastery across Grades: An Application to Spelling Ability

Peer reviewed

Direct link

Van Nijlen, Daniel; Janssen, Rianne – Applied Measurement in Education, 2011

The distinction between quantitative and qualitative differences in mastery is essential when monitoring student progress and is crucial for instructional interventions to deal with learning difficulties. Mixture item response theory (IRT) models can provide a convenient way to make the distinction between quantitative and qualitative differences…

Descriptors: Spelling, Indo European Languages, Vowels, Verbal Tests

Examining the Effectiveness of Test Accommodation Using DIF and a Mixture IRT Model

Peer reviewed

Direct link

Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012

This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…

Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity

Stability of Rasch Scales over Time

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2010

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…

Descriptors: Measures (Individuals), Item Response Theory, Robustness (Statistics), Item Analysis

Comparisons of Methodologies and Results in Vertical Scaling for Educational Achievement Tests

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael J. – Applied Measurement in Education, 2007

A number of vertical scaling methodologies were examined in this article. Scaling variations included data collection design, scaling method, item response theory (IRT) scoring procedure, and proficiency estimation method. Vertical scales were developed for Grade 3 through Grade 8 for 4 content areas and 9 simulated datasets. A total of 11 scaling…

Descriptors: Achievement Tests, Scaling, Methods, Item Response Theory

Item Response Theory	6
Grade 4	5
Grade 5	4
Grade 6	4
Grade 7	4
Achievement Tests	3
Grade 3	3
Grade 8	3
Mathematics Tests	3
Test Items	3
Elementary School Students	2
Foreign Countries	2
Item Analysis	2
Mathematics Achievement	2
Reading Achievement	2
Reading Tests	2
Scaling	2
Student Motivation	2
Test Bias	2
Testing Accommodations	2
Academic Ability	1
Adaptive Testing	1
Affective Behavior	1
Alternative Assessment	1
Barriers	1
More ▼