ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	29

Source

Educational Measurement:…

Publication Type

Journal Articles	29
Reports - Research	12
Reports - Descriptive	10
Reports - Evaluative	7
Information Analyses	1
Opinion Papers	1

Education Level

Elementary Secondary Education	3
Higher Education	3
Middle Schools	2
Postsecondary Education	2
Secondary Education	2
Adult Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Junior High Schools	1
More ▼

Audience

Location

USSR	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
Trends in International…	2
Program for the International…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

Linking Unlinkable Tests: A Step Forward

Peer reviewed

Direct link

Silvia Testa; Renato Miceli; Renato Miceli – Educational Measurement: Issues and Practice, 2025

Random Equating (RE) and Heuristic Approach (HA) are two linking procedures that may be used to compare the scores of individuals in two tests that measure the same latent trait, in conditions where there are no common items or individuals. In this study, RE--that may only be used when the individuals taking the two tests come from the same…

Descriptors: Comparative Testing, Heuristics, Problem Solving, Personality Traits

Digital Module 30: Validity and Educational Testing--Purposes and Uses of Educational Tests

Peer reviewed

Direct link

Lewis, Jennifer; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2022

This module is designed for educators, educational researchers, and psychometricians who would like to develop an understanding of the basic concepts of validity theory, test validation, and documenting a "validity argument." It also describes how an in-depth understanding of the purposes and uses of educational tests sets the foundation…

Descriptors: Test Validity, Tests, Testing Problems, Faculty Development

Measurement Invariance for Multilingual Learners Using Item Response and Response Time in PISA 2018

Peer reviewed

Direct link

Jung Yeon Park; Sean Joo; Zikun Li; Hyejin Yoon – Educational Measurement: Issues and Practice, 2025

This study examines potential assessment bias based on students' primary language status in PISA 2018. Specifically, multilingual (MLs) and nonmultilingual (non-MLs) students in the United States are compared with regard to their response time as well as scored responses across three cognitive domains (reading, mathematics, and science).…

Descriptors: Achievement Tests, Secondary School Students, International Assessment, Test Bias

An Examination of Classification Accuracy in the Continuous Testing Framework

Peer reviewed

Direct link

Coggeshall, Whitney Smiley – Educational Measurement: Issues and Practice, 2021

The continuous testing framework, where both successful and unsuccessful examinees have to demonstrate continued proficiency at frequent prespecified intervals, is a framework that is used in noncognitive assessment and is gaining in popularity in cognitive assessment. Despite the rigorous advantages of this framework, this paper demonstrates that…

Descriptors: Classification, Accuracy, Testing, Failure

Combining Process Information and Item Response Modeling to Estimate Problem-Solving Ability

Peer reviewed

Direct link

Xiao, Yue; Veldkamp, Bernard; Liu, Hongyun – Educational Measurement: Issues and Practice, 2022

The action sequences of respondents in problem-solving tasks reflect rich and detailed information about their performance, including differences in problem-solving ability, even if item scores are equal. It is therefore not sufficient to infer individual problem-solving skills based solely on item scores. This study is a preliminary attempt to…

Descriptors: Problem Solving, Item Response Theory, Scores, Item Analysis

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Deficiency, Contamination, and the Signal Processing Metaphor

Peer reviewed

Direct link

Newton, Paul E. – Educational Measurement: Issues and Practice, 2020

Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…

Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Digital Module 12: Think-Aloud Interviews and Cognitive Labs https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…

Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes

Assessment for Learning with Diverse Learners in a Digital World

Peer reviewed

Direct link

DiCerbo, Kristen – Educational Measurement: Issues and Practice, 2020

We have the ability to capture data from students' interactions with digital environments as they engage in learning activity. This provides the potential for a reimagining of assessment to one in which assessment become part of our natural education activity and can be used to support learning. These new data allow us to more closely examine the…

Descriptors: Student Diversity, Information Technology, Learning Activities, Learning Processes

Affordances of Item Formats and Their Effects on Test-Taker Cognition under Uncertainty

Peer reviewed

Direct link

Moon, Jung Aa; Keehner, Madeleine; Katz, Irvin R. – Educational Measurement: Issues and Practice, 2019

The current study investigated how item formats and their inherent affordances influence test-takers' cognition under uncertainty. Adult participants solved content-equivalent math items in multiple-selection multiple-choice and four alternative grid formats. The results indicated that participants' affirmative response tendency (i.e., judge the…

Descriptors: Affordances, Test Items, Test Format, Test Wiseness

Previous Page | Next Page »

Pages: 1 | 2

Testing Problems	14
Test Items	8
Educational Assessment	6
Evaluation Methods	6
Research Problems	6
Scores	6
Test Construction	6
Error of Measurement	5
Problem Solving	5
Test Bias	5
Evaluation Problems	4
Measurement	4
Program Effectiveness	4
Student Evaluation	4
Test Validity	4
Testing Programs	4
Achievement Tests	3
Educational Testing	3
Equated Scores	3
Mathematics Tests	3
Psychometrics	3
Scoring	3
Standard Setting (Scoring)	3
Testing	3
Accuracy	2
More ▼

Ferrara, Steve	2
Allalouf, Avi	1
An, Chen	1
Anderson, Dan	1
Angela Johnson	1
Arffman, Inga	1
Babcock, Ben	1
Bakker, Steven	1
Barron, Kenneth E.	1
Braun, Henry	1
Childs, Ruth A.	1
Coggeshall, Whitney Smiley	1
Davidson, Anne H.	1
DiCerbo, Kristen	1
Elizabeth Barker	1
Gattamorta, Karina	1
Grabovsky, Irina	1
Harris, William G.	1
Hein, Serge F.	1
Hulleman, Chris S.	1
Hyejin Yoon	1
Jung Yeon Park	1
Katz, Irvin R.	1
Keehner, Madeleine	1
Kim, Sooyeon	1
More ▼