ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	14

Descriptor

Comparative Analysis	27
Scoring	27
Test Format	27
Computer Assisted Testing	14
Test Items	12
Test Construction	8
Testing	8
Multiple Choice Tests	6
Foreign Countries	5
Higher Education	5
Adaptive Testing	4
English (Second Language)	4
Item Analysis	4
Language Tests	4
Psychometrics	4
Test Reliability	4
Test Validity	4
Testing Problems	4
Achievement Tests	3
College Students	3
Essay Tests	3
Grading	3
Reaction Time	3
Statistical Analysis	3
Test Interpretation	3
More ▼

Source

Computers & Education	2
ETS Research Report Series	2
ACT Education Corp.	1
Applied Measurement in…	1
Arizona Department of…	1
Education Policy Analysis…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
Language Testing	1
Large-scale Assessments in…	1
National Academies Press	1
ProQuest LLC	1
Teaching of Psychology	1
More ▼

Publication Type

Reports - Research	16
Journal Articles	14
Reports - Evaluative	5
Speeches/Meeting Papers	5
Reports - Descriptive	3
Books	2
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	3
Elementary Secondary Education	2
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 8	1
High Schools	1

Audience

Practitioners	2
Teachers	2

Location

Arizona	1
Europe	1
Hungary	1
Malawi	1
Maryland	1
United Arab Emirates	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Graduate Record Examinations	1
International English…	1
National Assessment of…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

A UEA Standardized Test and IELTS vis-à-vis International English Standards

Peer reviewed
PDF on ERIC

Download full text

Al Habbash, Maha; Alsheikh, Negmeldin; Liu, Xu; Al Mohammedi, Najah; Al Othali, Safa; Ismail, Sadiq Abdulwahed – International Journal of Instruction, 2021

This convergent mixed method study aimed at exploring the English context of the widely used Emirates Standardized Test (EmSAT) by juxtaposing it to its sequel, the International English Language Testing System (IELTS). For this purpose, the study used the Common European Framework of Reference (CEFR) international standards which is used as a…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Guidelines

A Pragmatic Future for NAEP: Containing Costs and Updating Technologies. Consensus Study Report

Peer reviewed
PDF on ERIC

Download full text

Direct link

National Academies Press, 2022

The National Assessment of Educational Progress (NAEP) -- often called "The Nation's Report Card" -- is the largest nationally representative and continuing assessment of what students in public and private schools in the United States know and can do in various subjects and has provided policy makers and the public with invaluable…

Descriptors: Costs, Futures (of Society), National Competency Tests, Educational Trends

Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items

Peer reviewed

Direct link

Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017

Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…

Descriptors: Automation, Scoring, Comparative Analysis, Test Items

Internet Administration of the Paper-and-Pencil Gifted Rating Scale: Assessing Psychometric Equivalence

Peer reviewed

Direct link

Yarnell, Jordy B.; Pfeiffer, Steven I. – Journal of Psychoeducational Assessment, 2015

The present study examined the psychometric equivalence of administering a computer-based version of the Gifted Rating Scale (GRS) compared with the traditional paper-and-pencil GRS-School Form (GRS-S). The GRS-S is a teacher-completed rating scale used in gifted assessment. The GRS-Electronic Form provides an alternative method of administering…

Descriptors: Gifted, Psychometrics, Rating Scales, Computer Assisted Testing

An Investigation on Computer-Adaptive Multistage Testing Panels for Multidimensional Assessment

Direct link

Wang, Xinrui – ProQuest LLC, 2013

The computer-adaptive multistage testing (ca-MST) has been developed as an alternative to computerized adaptive testing (CAT), and been increasingly adopted in large-scale assessments. Current research and practice only focus on ca-MST panels for credentialing purposes. The ca-MST test mode, therefore, is designed to gauge a single scale. The…

Descriptors: Computer Assisted Testing, Adaptive Testing, Diagnostic Tests, Comparative Analysis

An Item-Driven Adaptive Design for Calibrating Pretest Items. Research Report. ETS RR-14-38

Peer reviewed
PDF on ERIC

Download full text

Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014

Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…

Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items

Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

Peer reviewed

Direct link

Pellicer-Sanchez, Ana; Schmitt, Norbert – Language Testing, 2012

Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

Descriptors: Language Tests, Scoring, Reaction Time, Vocabulary Development

Comparison of Oral Examination and Electronic Examination Using Paired Multiple-Choice Questions

Peer reviewed

Direct link

Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2011

The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method against the oral examination (OE) method. MCQs are widely used and their importance seems likely to grow, due to their inherent suitability for electronic assessment. However, MCQs are influenced by the tendency of examinees to guess…

Descriptors: Grades (Scholastic), Scoring, Multiple Choice Tests, Test Format

Studies of a Latent-Class Signal-Detection Model for Constructed-Response Scoring. Research Report. ETS RR-08-63

Peer reviewed
PDF on ERIC

Download full text

DeCarlo, Lawrence T. – ETS Research Report Series, 2008

Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…

Descriptors: Scoring, Responses, Test Format, Bias

Comparison of Examination Methods Based on Multiple-Choice Questions and Constructed-Response Questions Using Personal Computers

Peer reviewed

Direct link

Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2010

The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method, to the examination based on constructed-response questions (CRQs). Despite that MCQs have an advantage concerning objectivity in the grading process and speed in production of results, they also introduce an error in the final…

Descriptors: Computer Assisted Instruction, Scoring, Grading, Comparative Analysis

The Contribution of Constructed Response Items to Large Scale Assessment: Measuring and Understanding Their Impact

Peer reviewed

Direct link

Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012

This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…

Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics

The GRE Psychology Test: A Useful but Poorly Understood Test.

Peer reviewed

Kalat, James W.; Matlin, Margaret W. – Teaching of Psychology, 2000

Provides an overview of the Graduate Record Examination (GRE) Psychology test focusing on the scoring system for the GRE Psychology test, who prepares the test and how the test is prepared, and the usefulness of the GRE Psychology test. Explores some future directions for the test. (CMK)

Descriptors: Comparative Analysis, Grade Point Average, Graduate Study, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Anderson, Paul S.	2
Stergiopoulos, Charalampos	2
Triantis, Dimos	2
Tsiakas, Panagiotis	2
Ventouras, Errikos	2
Al Habbash, Maha	1
Al Mohammedi, Najah	1
Al Othali, Safa	1
Ali, Usama S.	1
Alsheikh, Negmeldin	1
Ann Arthur	1
Baldwin, Peter	1
Boyer, Michelle	1
Chakwera, Elias	1
Chang, Hua-Hua	1
Chase, Clinton I.	1
Chen Qiu	1
Chi-Yu Huang	1
Clauser, Brian E.	1
DeCarlo, Lawrence T.	1
Dongmei Li	1
Goldhammer, Frank	1
Harrison, Scott	1
Henk, William A.	1
More ▼