ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	12

Source

Educational Measurement:…

Publication Type

Journal Articles	12
Reports - Research	5
Reports - Evaluative	4
Reports - Descriptive	3

Education Level

Middle Schools

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Digital Module 30: Validity and Educational Testing--Purposes and Uses of Educational Tests

Peer reviewed

Direct link

Lewis, Jennifer; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2022

This module is designed for educators, educational researchers, and psychometricians who would like to develop an understanding of the basic concepts of validity theory, test validation, and documenting a "validity argument." It also describes how an in-depth understanding of the purposes and uses of educational tests sets the foundation…

Descriptors: Test Validity, Tests, Testing Problems, Faculty Development

An Examination of Classification Accuracy in the Continuous Testing Framework

Peer reviewed

Direct link

Coggeshall, Whitney Smiley – Educational Measurement: Issues and Practice, 2021

The continuous testing framework, where both successful and unsuccessful examinees have to demonstrate continued proficiency at frequent prespecified intervals, is a framework that is used in noncognitive assessment and is gaining in popularity in cognitive assessment. Despite the rigorous advantages of this framework, this paper demonstrates that…

Descriptors: Classification, Accuracy, Testing, Failure

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Deficiency, Contamination, and the Signal Processing Metaphor

Peer reviewed

Direct link

Newton, Paul E. – Educational Measurement: Issues and Practice, 2020

Educational assessment involves eliciting, transmitting, and receiving information concerning the level of proficiency of a learner in a specified domain. With that in mind, it is perhaps surprising that the literature seems to make very little use of the signal processing metaphor. The present article begins by making a general case for greater…

Descriptors: Educational Assessment, Student Evaluation, Evaluative Thinking, Test Validity

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Detecting Measurement Disturbances in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Schumacker, Randall E. – Educational Measurement: Issues and Practice, 2017

The term measurement disturbance has been used to describe systematic conditions that affect a measurement process, resulting in a compromised interpretation of person or item estimates. Measurement disturbances have been discussed in relation to systematic response patterns associated with items and persons, such as start-up, plodding, boredom,…

Descriptors: Measurement, Testing Problems, Writing Tests, Performance Based Assessment

A Framework for Policies and Practices to Improve Test Security Programs: Prevention, Detection, Investigation, and Resolution (PDIR)

Peer reviewed

Direct link

Ferrara, Steve – Educational Measurement: Issues and Practice, 2017

Test security is not an end in itself; it is important because we want to be able to make valid interpretations from test scores. In this article, I propose a framework for comprehensive test security systems: prevention, detection, investigation, and resolution. The article discusses threats to test security, roles and responsibilities, rigorous…

Descriptors: Testing Programs, Educational Practices, Educational Policy, Program Improvement

Examining Estimates of Intervention Effectiveness Using Sensitivity Analysis

Peer reviewed

Direct link

An, Chen; Braun, Henry; Walsh, Mary E. – Educational Measurement: Issues and Practice, 2018

Making causal inferences from a quasi-experiment is difficult. Sensitivity analysis approaches to address hidden selection bias thus have gained popularity. This study serves as an introduction to a simple but practical form of sensitivity analysis using Monte Carlo simulation procedures. We examine estimated treatment effects for a school-based…

Descriptors: Statistical Inference, Intervention, Program Effectiveness, Quasiexperimental Design

Testing Problems	12
Test Items	4
Test Validity	4
Standard Setting (Scoring)	3
Accuracy	2
Classification	2
Error of Measurement	2
Evaluation Methods	2
Licensing Examinations…	2
Scores	2
Test Bias	2
Test Construction	2
Tests	2
Ability	1
Bilingual Students	1
Capacity Building	1
Change Strategies	1
Credentials	1
Cutting Scores	1
Diagnostic Tests	1
Difficulty Level	1
Educational Assessment	1
Educational Policy	1
Educational Practices	1
Equal Education	1
More ▼

An, Chen	1
Angela Johnson	1
Babcock, Ben	1
Braun, Henry	1
Coggeshall, Whitney Smiley	1
Elizabeth Barker	1
Ferrara, Steve	1
Grabovsky, Irina	1
Hein, Serge F.	1
Kim, Sooyeon	1
Leventhal, Brian C.	1
Lewis, Jennifer	1
Marcos Viveros Cespedes	1
Newton, Paul E.	1
Schumacker, Randall E.	1
Sinharay, Sandip	1
Sireci, Stephen G.	1
Skaggs, Gary	1
Walker, Michael E.	1
Walsh, Mary E.	1
Wilkins, Jesse L. M.	1
Wind, Stefanie A.	1
Wyse, Adam E.	1
More ▼