ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	19

Source

Applied Measurement in…

Publication Type

Journal Articles	19
Reports - Research	17
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Elementary Education	19
Middle Schools	12
Secondary Education	12
Junior High Schools	11
Grade 8	9
Grade 6	7
Grade 7	7
Intermediate Grades	7
Elementary Secondary Education	6
Grade 3	6
Grade 4	6
Grade 5	6
Early Childhood Education	4
Primary Education	4
Grade 2	3
Grade 9	3
High Schools	3
Grade 1	2
More ▼

Audience

Location

Australia	2
Belgium	1
California	1
Colorado	1
Finland	1
Florida	1
Germany	1
Iran (Tehran)	1
Italy	1
Netherlands	1
New York	1
North Carolina	1
Romania	1
Russia	1
Tennessee	1
Texas	1
United Kingdom (Northern…	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
Measures of Academic Progress	1
Progress in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

A Method of Empirical Q-Matrix Validation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Marcelo Andrade da Silva; A. Corinne Huggins-Manley; Jorge Luis Bazán; Amber Benedict – Applied Measurement in Education, 2024

A Q-matrix is a binary matrix that defines the relationship between items and latent variables and is widely used in diagnostic classification models (DCMs), and can also be adopted in multidimensional item response theory (MIRT) models. The construction process of the Q-matrix is typically carried out by experts in the subject area of the items…

Descriptors: Q Methodology, Matrices, Item Response Theory, Educational Assessment

Can Adaptive Testing Improve Test-Taking Experience? A Case Study on Educational Survey Assessment

Peer reviewed

Direct link

Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024

Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…

Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format

Leveraging Item Parameter Drift to Assess Transfer Effects in Vocabulary Learning

Peer reviewed

Direct link

Joshua B. Gilbert; James S. Kim; Luke W. Miratrix – Applied Measurement in Education, 2024

Longitudinal models typically emphasize between-person predictors of change but ignore how growth varies "within" persons because each person contributes only one data point at each time. In contrast, modeling growth with multi-item assessments allows evaluation of how relative item performance may shift over time. While traditionally…

Descriptors: Vocabulary Development, Item Response Theory, Test Items, Student Development

A Validation Argument from Soup to Nuts: Assessing Progress on Learning Trajectories for Middle-School Mathematics

Peer reviewed

Direct link

Confrey, Jere; Toutkoushian, Emily; Shah, Meetal – Applied Measurement in Education, 2019

Fully articulating validation arguments in the context of classroom assessment requires connecting evidence from multiple sources and addressing multiple types of validity in a coherent chain of reasoning. This type of validation argument is particularly complex for assessments that function in close proximity to instruction, address the fine…

Descriptors: Test Validity, Item Response Theory, Middle School Students, Mathematics Instruction

Performance Decline as an Indicator of Generalized Test-Taking Disengagement

Peer reviewed

Direct link

Wise, Steven L.; Kingsbury, G. Gage – Applied Measurement in Education, 2022

In achievement testing we assume that students will demonstrate their maximum performance as they encounter test items. Sometimes, however, student performance can decline during a test event, which implies that the test score does not represent maximum performance. This study describes a method for identifying significant performance decline and…

Descriptors: Achievement Tests, Performance, Classification, Guessing (Tests)

Differential Item Functioning for Accommodated Students with Disabilities: Effect of Differences in Proficiency Distributions

Peer reviewed

Direct link

Quesen, Sarah; Lane, Suzanne – Applied Measurement in Education, 2019

This study examined the effect of similar vs. dissimilar proficiency distributions on uniform DIF detection on a statewide eighth grade mathematics assessment. Results from the similar- and dissimilar-ability reference groups with an SWD focal group were compared for four models: logistic regression, hierarchical generalized linear model (HGLM),…

Descriptors: Test Items, Mathematics Tests, Grade 8, Item Response Theory

An IRT Mixture Model for Rating Scale Confusion Associated with Negatively Worded Items in Measures of Social-Emotional Learning

Peer reviewed

Direct link

Bolt, Daniel; Wang, Yang Caroline; Meyer, Robert H.; Pier, Libby – Applied Measurement in Education, 2020

We illustrate the application of mixture IRT models to evaluate respondent confusion due to the negative wording of certain items on a social-emotional learning (SEL) assessment. Using actual student self-report ratings on four social-emotional learning scales collected from students in grades 3-12 from CORE Districts in the state of California,…

Descriptors: Item Response Theory, Social Emotional Learning, Self Evaluation (Individuals), Measurement Techniques

Exploring the Robustness of a Unidimensional Item Response Theory Model with Empirically Multidimensional Data

Peer reviewed

Direct link

Anderson, Daniel; Kahn, Joshua D.; Tindal, Gerald – Applied Measurement in Education, 2017

Unidimensionality and local independence are two common assumptions of item response theory. The former implies that all items measure a common latent trait, while the latter implies that responses are independent, conditional on respondents' location on the latent trait. Yet, few tests are truly unidimensional. Unmodeled dimensions may result in…

Descriptors: Robustness (Statistics), Item Response Theory, Mathematics Tests, Grade 6

Negative Keying Effects in the Factor Structure of TIMSS 2011 Motivation Scales and Associations with Reading Achievement

Peer reviewed

Direct link

Michaelides, Michalis P. – Applied Measurement in Education, 2019

The Student Background survey administered along with achievement tests in studies of the International Association for the Evaluation of Educational Achievement includes scales of student motivation, competence, and attitudes toward mathematics and science. The scales consist of positively- and negatively keyed items. The current research…

Descriptors: International Assessment, Achievement Tests, Mathematics Achievement, Mathematics Tests

Requiring a Consistent Unit of Scale between the Responses of Students and Judges in Standard Setting

Peer reviewed

Direct link

Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014

One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…

Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory

Diagnosing Competency Mastery in Science: An Application of GDM to TIMSS 2011 Data

Peer reviewed

Direct link

Kabiri, Masoud; Ghazi-Tabatabaei, Mahmood; Bazargan, Abbas; Shokoohi-Yekta, Mohsen; Kharrazi, Kamal – Applied Measurement in Education, 2017

Numerous diagnostic studies have been conducted on large-scale assessments to illustrate the students' mastery profile in the areas of math and reading; however, for science a limited number of investigations are reported. This study investigated Iranian eighth graders' competency mastery of science and examined the utility of the General…

Descriptors: Elementary Secondary Education, Achievement Tests, International Assessment, Foreign Countries

Conceptualizing and Measuring Computer and Information Literacy in Cross-National Contexts

Peer reviewed

Direct link

Ainley, John; Fraillon, Julian; Schulz, Wolfram; Gebhardt, Eveline – Applied Measurement in Education, 2016

The development of information technologies has transformed the environment in which young people access, create, and share information. Many countries, having recognized the imperative of digital technology, acknowledge the need to educate young people in the use of these technologies so as to underpin economic and social benefits. This article…

Descriptors: Cross Cultural Studies, Information Literacy, Computer Literacy, Grade 8

A Comparison of Teacher Effectiveness Measures Calculated Using Three Multilevel Models for Raters Effects

Peer reviewed

Direct link

Murphy, Daniel L.; Beretvas, S. Natasha – Applied Measurement in Education, 2015

This study examines the use of cross-classified random effects models (CCrem) and cross-classified multiple membership random effects models (CCMMrem) to model rater bias and estimate teacher effectiveness. Effect estimates are compared using CTT versus item response theory (IRT) scaling methods and three models (i.e., conventional multilevel…

Descriptors: Teacher Effectiveness, Comparative Analysis, Hierarchical Linear Modeling, Test Theory

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Measuring Mastery across Grades: An Application to Spelling Ability

Peer reviewed

Direct link

Van Nijlen, Daniel; Janssen, Rianne – Applied Measurement in Education, 2011

The distinction between quantitative and qualitative differences in mastery is essential when monitoring student progress and is crucial for instructional interventions to deal with learning difficulties. Mixture item response theory (IRT) models can provide a convenient way to make the distinction between quantitative and qualitative differences…

Descriptors: Spelling, Indo European Languages, Vowels, Verbal Tests

Previous Page | Next Page »

Pages: 1 | 2

Item Response Theory	17
Grade 8	8
Mathematics Tests	8
Test Items	8
Grade 7	7
Elementary School Students	6
Foreign Countries	6
Grade 4	6
Grade 6	6
Grade 3	5
Achievement Tests	4
Grade 5	4
Computation	3
Computer Assisted Testing	3
Correlation	3
Elementary Secondary Education	3
Grade 2	3
Grade 9	3
Mathematics Achievement	3
Measurement	3
Monte Carlo Methods	3
Reading Tests	3
Test Bias	3
Test Format	3
Testing Accommodations	3
More ▼

A. Corinne Huggins-Manley	1
Ainley, John	1
Albano, Anthony D.	1
Amber Benedict	1
Anderson, Daniel	1
Andrich, David	1
Bazargan, Abbas	1
Beretvas, S. Natasha	1
Bolt, Daniel	1
Cho, Hyun-Jeong	1
Confrey, Jere	1
Fraillon, Julian	1
Gebhardt, Eveline	1
Ghazi-Tabatabaei, Mahmood	1
Heldsinger, Sandra	1
Henly, George A.	1
Hickendorff, Marian	1
Humphry, Stephen	1
Ito, Kyoko	1
James S. Kim	1
Janssen, Rianne	1
Jorge Luis Bazán	1
Joshua B. Gilbert	1
Kabiri, Masoud	1
Kahn, Joshua D.	1
More ▼