ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	9
Since 2007 (last 20 years)	36

Descriptor

Comparative Analysis	81
Test Bias	81
Test Items	31
Testing Problems	26
Scores	18
Computer Assisted Testing	17
Testing	17
Test Validity	14
Evaluation Methods	13
Adaptive Testing	11
Mathematics Tests	11
Simulation	11
Educational Assessment	10
Elementary School Students	10
Psychometrics	10
English (Second Language)	9
Item Response Theory	9
Standardized Tests	9
Statistical Analysis	9
Test Construction	9
Disabilities	8
Educational Testing	8
Language Tests	8
Scoring	8
Test Format	8
More ▼

Publication Type

Reports - Research	47
Journal Articles	41
Reports - Evaluative	14
Speeches/Meeting Papers	11
Information Analyses	5
Reports - Descriptive	5
Dissertations/Theses -…	4
Guides - General	3
Guides - Non-Classroom	3
Opinion Papers	2
Tests/Questionnaires	2
Collected Works - Serials	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Education	9
Elementary Secondary Education	6
Grade 4	5
Grade 5	4
Grade 8	4
Intermediate Grades	4
Grade 3	3
High Schools	3
Higher Education	3
Secondary Education	3
Grade 7	2
Grade 9	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Early Childhood Education	1
Grade 11	1
Grade 6	1
Primary Education	1
More ▼

Audience

Researchers	2
Administrators	1
Policymakers	1
Practitioners	1

Location

Australia	2
Canada	2
Arizona	1
Illinois	1
Maryland	1
North Carolina	1
Pennsylvania	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Rehabilitation Act 1973…	1
Social Security	1

Assessments and Surveys

SAT (College Admission Test)	5
Wechsler Intelligence Scale…	5
National Assessment of…	4
Stanford Achievement Tests	3
Program for International…	2
California Achievement Tests	1
Clinical Evaluation of…	1
Comprehensive Tests of Basic…	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
National Teacher Examinations	1
Stanford Diagnostic Reading…	1
Teacher Rating Scale	1
Woodcock Johnson Psycho…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 81 results Save | Export

New Meridian Comparability Review Guidelines. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

Quality Testing Standards and Criteria for Comparability Claims. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

Descriptors: Testing, Standards, Comparative Analysis, Guidelines

"Quality Testing Standards" -- A Starter Kit for States. Version 6.17.2020

Download full text

New Meridian Corporation, 2020

New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…

Descriptors: Testing, Standards, Comparative Analysis, Test Content

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Comparison of Disengagement Levels and the Impact of Disengagement on Item Parameters between PISA 2015 and PISA 2018 in the United States

Peer reviewed

Direct link

Kuang, Huan; Sahin, Fusun – Large-scale Assessments in Education, 2023

Background: Examinees may not make enough effort when responding to test items if the assessment has no consequence for them. These disengaged responses can be problematic in low-stakes, large-scale assessments because they can bias item parameter estimates. However, the amount of bias, and whether this bias is similar across administrations, is…

Descriptors: Test Items, Comparative Analysis, Mathematics Tests, Reaction Time

Mitigating Gender and L1 Biases in Automated English Speaking Assessment

Direct link

Alexander James Kwako – ProQuest LLC, 2023

Automated assessment using Natural Language Processing (NLP) has the potential to make English speaking assessments more reliable, authentic, and accessible. Yet without careful examination, NLP may exacerbate social prejudices based on gender or native language (L1). Current NLP-based assessments are prone to such biases, yet research and…

Descriptors: Gender Bias, Natural Language Processing, Native Language, Computational Linguistics

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

Mode Comparability Study Based on Spring 2015 Operational Test Data

Download full text

Liu, Junhui; Brown, Terran; Chen, Jianshen; Ali, Usama; Hou, Likun; Costanzo, Kate – Partnership for Assessment of Readiness for College and Careers, 2016

The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium working to develop next-generation assessments that more accurately, compared to previous assessments, measure student progress toward college and career readiness. The PARCC assessments include both English Language Arts/Literacy (ELA/L) and…

Descriptors: Testing, Achievement Tests, Test Items, Test Bias

Multiple True-False Items: A Comparison of Scoring Algorithms

Peer reviewed

Direct link

Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018

Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…

Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Multiple-Group Noncompensatory Differential Item Functioning in Raju's Differential Functioning of Items and Tests

Peer reviewed

Direct link

Oshima, T. C.; Wright, Keith; White, Nick – International Journal of Testing, 2015

Raju, van der Linden, and Fleer (1995) introduced a framework for differential functioning of items and tests (DFIT) for unidimensional dichotomous models. Since then, DFIT has been shown to be a quite versatile framework as it can handle polytomous as well as multidimensional models both at the item and test levels. However, DFIT is still limited…

Descriptors: Test Bias, Item Response Theory, Test Items, Simulation

Longitudinal Multistage Testing

Peer reviewed

Direct link

Pohl, Steffi – Journal of Educational Measurement, 2013

This article introduces longitudinal multistage testing (lMST), a special form of multistage testing (MST), as a method for adaptive testing in longitudinal large-scale studies. In lMST designs, test forms of different difficulty levels are used, whereas the values on a pretest determine the routing to these test forms. Since lMST allows for…

Descriptors: Adaptive Testing, Longitudinal Studies, Difficulty Level, Comparative Analysis

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

International Journal of…	5
Journal of Educational…	5
Applied Measurement in…	4
Educational and Psychological…	4
ProQuest LLC	4
ETS Research Report Series	3
New Meridian Corporation	3
Educational Assessment	2
Journal of Applied Testing…	2
Psychology in the Schools	2
Advances in Health Sciences…	1
American Educational Research…	1
Assessing Writing	1
Assessment & Evaluation in…	1
College English	1
ELT Journal	1
Educational Testing Service	1
Evaluation Practice	1
Gifted Child Quarterly	1
Journal of Communication…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Negro Education	1
Journal of Special Education	1
Journal of Technology,…	1
More ▼

Abedi, Jamal	2
Banks, Kathleen	2
Bennett, Randy Elliot	2
Horkay, Nancy, Ed.	2
Hou, Likun	2
Oshima, T. C.	2
Sireci, Stephen G.	2
Steinberg, Jonathan	2
Ahn, Soyeon	1
Alexander James Kwako	1
Ali, Usama	1
Ali, Usama S.	1
Allen, Nancy	1
And Others.	1
Ariel, Adelaide	1
Armstrong, Bill	1
Ayodele, Alicia Nicole	1
Barton, Karen	1
Bauer, Daniel	1
Bennett, Randy Elliott	1
Beretvas, S. Natasha	1
Bleistein, Carole A.	1
Breland, Hunter	1
Brown, Richard S.	1
More ▼