ERIC - Search Results

Publication Date

In 2025	3
Since 2024	17
Since 2021 (last 5 years)	82
Since 2016 (last 10 years)	154
Since 2006 (last 20 years)	212

Descriptor

Scores	289
Test Format	289
Foreign Countries	105
Test Items	100
Comparative Analysis	83
Computer Assisted Testing	82
Language Tests	75
English (Second Language)	68
Second Language Learning	68
Multiple Choice Tests	61
Undergraduate Students	52
Test Construction	44
Correlation	42
Language Proficiency	40
College Students	39
Second Language Instruction	38
Test Reliability	38
Higher Education	35
Statistical Analysis	34
Difficulty Level	32
Student Attitudes	32
Test Validity	32
Item Response Theory	31
Item Analysis	25
Reading Tests	25
More ▼

Publication Type

Journal Articles	289
Reports - Research	236
Reports - Evaluative	35
Tests/Questionnaires	14
Reports - Descriptive	11
Opinion Papers	4
Guides - Non-Classroom	2
Information Analyses	2
Speeches/Meeting Papers	2
Reports - General	1

Education Level

Higher Education	107
Postsecondary Education	92
Secondary Education	26
Elementary Education	16
Middle Schools	15
High Schools	14
Junior High Schools	11
Grade 8	8
Early Childhood Education	6
Grade 3	5
Primary Education	4
Elementary Secondary Education	3
Grade 4	3
Intermediate Grades	3
Adult Education	2
Grade 10	2
Grade 11	2
Grade 2	2
Grade 5	2
Grade 7	2
Grade 9	2
Kindergarten	2
Grade 1	1
Grade 12	1
Grade 6	1
More ▼

Audience

Practitioners	4
Researchers	2
Teachers	2
Administrators	1

Location

Turkey	14
China	12
Iran	7
Australia	4
Germany	4
Indonesia	4
Japan	4
Sweden	4
Israel	3
United Kingdom	3
United Kingdom (England)	3
Canada	2
Ghana	2
Malaysia	2
Netherlands	2
Norway	2
Philippines	2
Russia	2
South Africa	2
Taiwan	2
United States	2
Arizona	1
Asia	1
Belgium	1
California	1
More ▼

Laws, Policies, & Programs

Head Start	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 289 results Save | Export

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Analysis of Mixed-Format Assessments Using Measurement Models and Topic Modeling

Peer reviewed

Direct link

Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025

It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…

Descriptors: Responses, Test Items, Test Format, Grade 8

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Exploring Interaction in Video-Call Paired Speaking Tests: A Look at Scores, Language, and Perceptions

Peer reviewed

Direct link

Hye-won Lee; Andrew Mullooly; Amy Devine; Evelina Galaczi – Applied Linguistics, 2024

In the assessment of second language oral communication, the video-call speaking test has received increasing attention as a test method with higher practicality than its in-person counterpart, but still with broad coverage of the test construct. Previous studies into video-call assessment have focussed on the individual (as opposed to paired or…

Descriptors: Oral Language, Language Skills, Speech Communication, Interaction Process Analysis

Comparing the Effectiveness of Newer Linework on the Mental Cutting Test (MCT) to Investigate Its Delivery in Online Educational Settings

Peer reviewed
PDF on ERIC

Download full text

Green, Theresa; Goodridge, Wade H.; Anderson, Jon; Davishahl, Eric; Kane, Daniel – International Education Studies, 2023

The purpose of this study was to examine any differences in test scores between three different online versions of the Mental Cutting Test (MCT). The MCT was developed to quantify a rotational and proportion construct of spatial ability and has been used extensively to assess spatial ability. This test was developed in 1938 as a paper-and-pencil…

Descriptors: Spatial Ability, Measures (Individuals), Computer Assisted Testing, Test Format

Comparing National Benchmark Quantitative Literacy Paper-Based Tests Scores with Its Computer-Based Counterparts

Peer reviewed
PDF on ERIC

Download full text

Robert N. Prince – Numeracy, 2025

One of the effects of the COVID-19 pandemic was the rapid shift to replacing traditional, paper-based tests with their computer-based counterparts. In many cases, these new modes of delivering tests will remain in place for the foreseeable future. In South Africa, the National Benchmark Quantitative Literacy (QL) test was impelled to make this…

Descriptors: Benchmarking, Numeracy, Multiple Literacies, Paper and Pencil Tests

Pilot Comparison of Reading Quiz Formats in a Graduate Speech Sound Disorders Course

Peer reviewed
PDF on ERIC

Download full text

Sheri Bayley – Teaching and Learning in Communication Sciences & Disorders, 2024

The purpose of this study was to explore student performance, self-ratings of learning and preference, and student comments on a variety of reading quiz formats in a first semester speech-language pathology graduate course. Students from two cohorts (n = 34) completed four types of quizzes: closed-book, open-book, open-note, and collaborative…

Descriptors: Reading Instruction, Tests, Graduate Students, Courses

Does Typing or Handwriting Exam Responses Make Any Difference? Evidence from the Literature

Download full text

Santi Lestari – Research Matters, 2024

Despite the increasing ubiquity of computer-based tests, many general qualifications examinations remain in a paper-based mode. Insufficient and unequal digital provision across schools is often identified as a major barrier to a full adoption of computer-based exams for general qualifications. One way to overcome this barrier is a gradual…

Descriptors: Keyboarding (Data Entry), Handwriting, Test Format, Comparative Analysis

Computer-Delivered vs. Face-to-Face Score Comparability and Test Takers' Perceptions: The Case of the Two English Speaking Proficiency Tests for Vietnamese EFL Learners

Peer reviewed

Direct link

Thuy Ho Hoang Nguyen; Bao Trang Thi Nguyen; Giang Thi Linh Hoang; Nhung Thi Hong Pham; Tu Thi Cam Dang – Language Testing in Asia, 2024

The present study explored the comparability in performance scores between the computer-delivered and face-to-face modes for the two speaking tests in the Vietnamese Standardized Test of English Proficiency (VSTEP) (the VSTEP.2 and VSTEP.3-5 Speaking tests) according to Vietnam's Six-Level Foreign Language Proficiency Framework (VNFLPF) and test…

Descriptors: Test Format, Computer Assisted Testing, Student Attitudes, Language Tests

Score Comparability between Online Proctored and In-Person Credentialing Exams

Peer reviewed

Direct link

Jones, Paul; Tong, Ye; Liu, Jinghua; Borglum, Joshua; Primoli, Vince – Journal of Educational Measurement, 2022

This article studied two methods to detect mode effects in two credentialing exams. In Study 1, we used a "modal scale comparison approach," where the same pool of items was calibrated separately, without transformation, within two TC cohorts (TC1 and TC2) and one OP cohort (OP1) matched on their pool-based scale score distributions. The…

Descriptors: Scores, Credentials, Licensing Examinations (Professions), Computer Assisted Testing

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

A Reliability Generalization Meta-Analysis of Runco Ideational Behavior Scale

Peer reviewed

Direct link

Sen, Sedat – Creativity Research Journal, 2022

The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…

Descriptors: Generalization, Creativity, Meta Analysis, Higher Education

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 20

Educational and Psychological…	14
Language Testing	14
ETS Research Report Series	11
Journal of Educational…	11
Practical Assessment,…	8
Educational Measurement:…	6
Journal of Experimental…	6
Applied Measurement in…	5
Education and Information…	5
Journal of Educational…	5
Language Assessment Quarterly	5
Language Testing in Asia	5
Educational Assessment	4
English Language Teaching	4
International Journal of…	4
CBE - Life Sciences Education	3
Evaluation and the Health…	3
International Journal of…	3
International Journal of…	3
Journal of Economic Education	3
Journal of Education for…	3
Journal of Educational…	3
Psychological Assessment	3
Reading in a Foreign Language	3
Teaching of Psychology	3
More ▼

Aspiranti, Kathleen B.	2
Baghaei, Purya	2
Bulut, Okan	2
Cohen, Allan S.	2
Colliver, Jerry A.	2
Gu, Lin	2
Han, Kyung T.	2
Henze, Erin E. C.	2
Holley, Joyce H.	2
Jenkins, Elizabeth K.	2
Jianbin Fu	2
Katz, Irvin R.	2
Keehner, Madeleine	2
Khoshsima, Hooshang	2
Kim, Sooyeon	2
Lee, Senyung	2
Ling, Guangming	2
McLean, Stuart	2
Moon, Jung Aa	2
Moses, Tim	2
O'Grady, Stefan	2
Pae, Hye K.	2
Patrick C. Kyllonen	2
Plake, Barbara S.	2
More ▼

Test of English as a Foreign…	13
International English…	6
Program for International…	4
Test of English for…	4
Graduate Record Examinations	3
ACT Assessment	2
Minnesota Multiphasic…	2
National Assessment of…	2
SAT (College Admission Test)	2
ACTFL Oral Proficiency…	1
Academic Motivation Scale	1
Bem Sex Role Inventory	1
Computer Attitude Scale	1
Delaware Student Testing…	1
Foreign Language Classroom…	1
General Educational…	1
Law School Admission Test	1
Learning Style Inventory	1
Medical College Admission Test	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
Phonological Awareness…	1
Praxis Series	1
Preliminary Scholastic…	1
Raven Progressive Matrices	1
More ▼