ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	33

Descriptor

Test Format	35
Test Items	16
Comparative Analysis	14
Computer Assisted Testing	12
Equated Scores	11
Scores	11
Statistical Analysis	10
Test Construction	10
Item Response Theory	8
Multiple Choice Tests	7
Responses	7
Accuracy	6
Difficulty Level	6
Raw Scores	6
College Entrance Examinations	5
English (Second Language)	5
Error of Measurement	5
Language Tests	5
Mathematics Tests	5
Models	5
Scoring	5
Second Language Learning	5
Simulation	5
Test Reliability	5
Test Validity	5
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	35
Reports - Research	33
Tests/Questionnaires	2
Collected Works - General	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	8
Postsecondary Education	8
Secondary Education	3
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

United States	2
Canada	1
China	1
Colombia	1
Germany	1
India	1
Japan	1
Jordan	1
Mexico	1
New Jersey	1
Pennsylvania	1
South Korea	1
Turkey	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Test of English as a Foreign…	3
Graduate Record Examinations	2
National Assessment of…	2
Praxis Series	2
ACT Assessment	1
Advanced Placement…	1
College Level Examination…	1
Law School Admission Test	1
Stanford Achievement Tests	1
Test of English for…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Detecting the Impact of Remote Proctored At-Home Testing Using Propensity Score Weighting. Research Report. ETS RR-24-11

Peer reviewed
PDF on ERIC

Download full text

Jing Miao; Yi Cao; Michael E. Walker – ETS Research Report Series, 2024

Studies of test score comparability have been conducted at different stages in the history of testing to ensure that test results carry the same meaning regardless of test conditions. The expansion of at-home testing via remote proctoring sparked another round of interest. This study uses data from three licensure tests to assess potential mode…

Descriptors: Testing, Test Format, Computer Assisted Testing, Home Study

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

Detecting Test-Taking Engagement in Changing Test Contexts. Research Report. ETS RR-24-09

Peer reviewed
PDF on ERIC

Download full text

Blair Lehman; Jesse R. Sparks; Jonathan Steinberg – ETS Research Report Series, 2024

Over the last 20 years, many methods have been proposed to use process data (e.g., response time) to detect changes in engagement during the test-taking process. However, many of these methods were developed and evaluated in highly similar testing contexts: 30 or more single-select multiple-choice items presented in a linear, fixed sequence in…

Descriptors: National Competency Tests, Secondary School Mathematics, Secondary School Students, Mathematics Tests

Effect of Statistically Matching Equating Samples for Common-Item Equating. Research Report. ETS RR-21-02

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021

This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…

Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items

Influence of Selected-Response Format Variants on Test Characteristics and Test-Taking Effort: An Empirical Study. Research Report. ETS RR-22-01

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Rios, Joseph A.; Ling, Guangming; Wang, Zhen; Gu, Lin; Yang, Zhitong; Liu, Lydia O. – ETS Research Report Series, 2022

Different variants of the selected-response (SR) item type have been developed for various reasons (i.e., simulating realistic situations, examining critical-thinking and/or problem-solving skills). Generally, the variants of SR item format are more complex than the traditional multiple-choice (MC) items, which may be more challenging to test…

Descriptors: Test Format, Test Wiseness, Test Items, Item Response Theory

Does Rearranging Multiple-Choice Item Response Options Affect Item and Test Performance? Research Report. ETS RR-19-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin – ETS Research Report Series, 2019

Rearranging response options in different versions of a test of multiple-choice items can be an effective strategy against cheating on the test. This study investigated if rearranging response options would affect item performance and test score comparability. A study test was assembled as the base version from which 3 variant versions were…

Descriptors: Multiple Choice Tests, Test Items, Test Format, Scores

The Impact of Aberrant Responses and Detection in Forced-Choice Noncognitive Assessment. Research Report. ETS RR-18-32

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2018

The purpose of this study is to assess the impact of aberrant responses on the estimation accuracy in forced-choice format assessments. To that end, a wide range of aberrant response behaviors (e.g., fake, random, or mechanical responses) affecting upward of 20%--30% of the responses was manipulated under the multi-unidimensional pairwise…

Descriptors: Measurement Techniques, Response Style (Tests), Accuracy, Computation

Charting the Future of Assessments. Research Report. ETS RR-24-13

Peer reviewed
PDF on ERIC

Download full text

Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…

Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction

Statistical Properties of the "GRE"® Psychology Test Subscores. ETS GRE® Board Research Report. ETS GRE®-18-02. ETS Research Report. RR-18-19

Peer reviewed
PDF on ERIC

Download full text

Liu, Yuming; Robin, Frédéric; Yoo, Hanwook; Manna, Venessa – ETS Research Report Series, 2018

The "GRE"® Psychology test is an achievement test that measures core knowledge in 12 content domains that represent the courses commonly offered at the undergraduate level. Currently, a total score and 2 subscores, experimental and social, are reported to test takers as well as graduate institutions. However, the American Psychological…

Descriptors: College Entrance Examinations, Graduate Study, Psychological Testing, Scores

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Examining the Accuracy of a Conversation-Based Assessment in Interpreting English Learners' Written Responses. Research Report. ETS RR-21-03

Peer reviewed
PDF on ERIC

Download full text

Lopez, Alexis A.; Guzman-Orth, Danielle; Zapata-Rivera, Diego; Forsyth, Carolyn M.; Luce, Christine – ETS Research Report Series, 2021

Substantial progress has been made toward applying technology enhanced conversation-based assessments (CBAs) to measure the English-language proficiency of English learners (ELs). CBAs are conversation-based systems that use conversations among computer-animated agents and a test taker. We expanded the design and capability of prior…

Descriptors: Accuracy, English Language Learners, Language Proficiency, Language Tests

The Measure Matters: Examining Achievement Gaps on Cognitively Demanding Reading and Mathematics Assessments. Policy Information Report and ETS Research Report Series No. RR-19-43

Peer reviewed
PDF on ERIC

Download full text

Kevelson, Marisol J. C. – ETS Research Report Series, 2019

This study presents estimates of Black-White, Hispanic-White, and income achievement gaps using data from two different types of reading and mathematics assessments: constructed-response assessments that were likely more cognitively demanding and state achievement tests that were likely less cognitively demanding (i.e., composed solely or largely…

Descriptors: Racial Differences, Achievement Gap, White Students, African American Students

ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim – ETS Research Report Series, 2013

The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

Descriptors: Psychometrics, Scores, Correlation, Regression (Statistics)

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Usability of Interactive Item Types and Tools Introduced in the New GRE® Revised General Test. ETS GRE® Board Research Report. ETS GRE®-14-05. ETS Research Report. RR-14-28

Peer reviewed
PDF on ERIC

Download full text

Swiggett, Wanda D.; Kotloff, Laurie; Ezzo, Chelsea; Adler, Rachel; Oliveri, Maria Elena – ETS Research Report Series, 2014

The computer-based "Graduate Record Examinations"® ("GRE"®) revised General Test includes interactive item types and testing environment tools (e.g., test navigation, on-screen calculator, and help). How well do test takers understand these innovations? If test takers do not understand the new item types, these innovations may…

Descriptors: College Entrance Examinations, Graduate Study, Usability, Test Items

Previous Page | Next Page »

Pages: 1 | 2 | 3

Kim, Sooyeon	7
Moses, Tim	4
Walker, Michael E.	3
Liu, Jinghua	2
McHale, Frederick	2
Puhan, Gautam	2
Wang, Lin	2
Wang, Zhen	2
Adler, Rachel	1
Ali, Usama S.	1
Amit Sevak	1
Bivens-Tatum, Jennifer	1
Blair Lehman	1
Boughton, Keith A.	1
Brenneman, Meghan	1
Casabianca, Jodi M.	1
Castellano, Karen	1
Chang, Hua-Hua	1
Chen, Jing	1
Daniel Fishtein	1
Davis, Larry	1
DeCarlo, Lawrence T.	1
Deane, Paul	1
Ezzo, Chelsea	1
Forsyth, Carolyn M.	1
More ▼