ERIC - Search Results

Publication Date

In 2026	0
Since 2025	10
Since 2022 (last 5 years)	54
Since 2017 (last 10 years)	97
Since 2007 (last 20 years)	163

Descriptor

Test Format	506
Test Validity	506
Test Reliability	243
Test Construction	180
Test Items	127
Foreign Countries	108
Language Tests	96
Higher Education	86
Testing	80
Computer Assisted Testing	72
Test Use	67
Multiple Choice Tests	64
Scores	59
English (Second Language)	58
Second Language Learning	57
Standardized Tests	53
Student Evaluation	53
Test Interpretation	53
Elementary Secondary Education	52
Testing Problems	52
Language Proficiency	49
Comparative Analysis	48
Scoring	47
Test Content	47
Evaluation Methods	46
More ▼

Education Level

Higher Education	60
Postsecondary Education	50
Secondary Education	30
Elementary Education	25
Middle Schools	19
Junior High Schools	15
High Schools	13
Grade 8	11
Grade 4	9
Elementary Secondary Education	8
Grade 5	8
Grade 3	7
Intermediate Grades	7
Early Childhood Education	6
Grade 6	6
Grade 7	6
Primary Education	5
Adult Education	3
Grade 11	1
Grade 2	1
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners	30
Teachers	19
Administrators	17
Researchers	9
Community	1
Policymakers	1
Students	1
Support Staff	1

Location

Canada	10
China	9
New York	9
Japan	7
Netherlands	6
Germany	5
Turkey	5
United Kingdom	5
United Kingdom (England)	5
Australia	4
Georgia	4
Iran	4
United States	4
Israel	3
New Zealand	3
Indonesia	2
Mexico	2
North Carolina	2
Oregon	2
Singapore	2
South Korea	2
United Kingdom (Great Britain)	2
United Kingdom (Northern…	2
United Kingdom (Wales)	2
Africa	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 506 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

Real-Life Applications of Competence-Based Test Development to the Construction, Improvement, and Shortening of Tests

Peer reviewed

Direct link

Pasquale Anselmi; Jürgen Heller; Luca Stefanutti; Egidio Robusto; Giulia Barillari – Education and Information Technologies, 2025

Competence-based test development (CbTD) is a novel method for constructing tests that are as informative as possible about the competence state (the set of skills an individual masters) underlying the item responses. If desired, the tests can also be minimal, meaning that no item can be eliminated without reducing their informativeness. To…

Descriptors: Competency Based Education, Test Construction, Test Length, Usability

Test Review of Iranian English Language Proficiency Test: MSRT Test

Peer reviewed

Direct link

Ali Khodi; Logendra Stanley Ponniah; Amir Hossein Farrokhi; Fateme Sadeghi – Language Testing in Asia, 2024

The current article evaluates a national English language proficiency test known as the "MSRT test" which is used to determine the eligibility of candidates for admission to and completion of higher education programs in Iran. Students in all majors take this standardized, high-stake criterion-referenced test to determine if they have…

Descriptors: Foreign Countries, Language Tests, Reading Tests, Language Proficiency

Investigating the Role of Response Format in Computer-Based Lecture Comprehension Tasks

Peer reviewed

Direct link

Stefan O'Grady – International Journal of Listening, 2025

Language assessment is increasingly computermediated. This development presents opportunities with new task formats and equally a need for renewed scrutiny of established conventions. Recent recommendations to increase integrated skills assessment in lecture comprehension tests is premised on empirical research that demonstrates enhanced construct…

Descriptors: Language Tests, Lecture Method, Listening Comprehension Tests, Multiple Choice Tests

The MSRT: A Critical Review of English Proficiency in Iran

Peer reviewed

Direct link

Muhammed Parviz; Masoud Azizi – Discover Education, 2025

This article offers a critical review of the Ministry of Science, Research, and Technology English Proficiency Test (MSRT), a high-stakes exam required for postgraduate graduation, scholarships, and certain employment positions in Iran. Despite its widespread use, the design and implementation of the MSRT raise concerns about its validity and…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

The Effect of Question Positioning on Data Quality in Web Surveys

Peer reviewed

Direct link

Cornelia Eva Neuert – Sociological Methods & Research, 2024

The quality of data in surveys is affected by response burden and questionnaire length. With an increasing number of questions, respondents can become bored, tired, and annoyed and may take shortcuts to reduce the effort needed to complete the survey. In this article, direct evidence is presented on how the position of items within a web…

Descriptors: Online Surveys, Test Items, Test Format, Test Construction

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Design Framework for the ACT® Enhancements. ACT Research. Research Report. R2519

Download full text

Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025

This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…

Descriptors: College Entrance Examinations, Testing, Change, Test Construction

New York State Testing Program: Grades 6 and 7 English Language Arts Paper-Based Tests. Teacher's Directions. Spring 2024

Download full text

New York State Education Department, 2024

The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…

Descriptors: Language Tests, Test Format, Language Arts, English Instruction

Meta[superscript 2]: A Meta-Analysis and Psychometric Evaluation of the Metacognitive Awareness Inventory (MAI) in the Context of Health Professions Education

Peer reviewed

Direct link

Andrew S. Cale; Elizabeth R. Agosto; Brenda Kucha Anak Ganeng; Megan E. Kruskie; Margaret A. McNulty; Kyle A. Robertson; Cecelia J. Vetter; Sabrina C. Woods; Md. Nazmul Karim; Adam B. Wilson – Anatomical Sciences Education, 2025

To keep pace with medicine's unpredictable changes, medical trainees must learn to accurately monitor and evaluate themselves via metacognition (i.e., thinking about thinking). The Metacognitive Awareness Inventory (MAI) can assess and guide the metacognitive development of trainees. This study summarizes existing psychometric evidence and…

Descriptors: Meta Analysis, Psychometrics, Metacognition, Measures (Individuals)

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

The DAATS Battery Short Form as a Measure of Teacher Dispositions

Peer reviewed
PDF on ERIC

Download full text

Judy R. Wilkerson; W. Steve Lang; LaSonya Moore – Journal of Research in Education, 2025

The DAATS (Dispositions Assessments Aligned with Teacher Standards) battery is a series of five instruments of different item types that measure teachers' consistency with the critical dispositions embedded in the InTASC Standards. The purpose of this study was to continue a 20-year research project on the development and implementation of…

Descriptors: Educational Assessment, National Standards, Teacher Evaluation, Teacher Competencies

Improving Student Understanding of Quantum Measurement in Infinite-Dimensional Hilbert Space Using a Research-Based Multiple-Choice Question Sequence

Peer reviewed

Direct link

Yangqiuting Li; Chandralekha Singh – Physical Review Physics Education Research, 2025

Research-based multiple-choice questions implemented in class with peer instruction have been shown to be an effective tool for improving students' engagement and learning outcomes. Moreover, multiple-choice questions that are carefully sequenced to build on each other can be particularly helpful for students to develop a systematic understanding…

Descriptors: Physics, Science Instruction, Science Tests, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 34

Diagnostique	26
Educational and Psychological…	25
Language Testing	14
Journal of Educational…	9
New York State Education…	9
Language Assessment Quarterly	8
Journal of Reading	7
Psychological Assessment	7
ETS Research Report Series	5
Online Submission	5
Applied Psychological…	4
Assessment	4
Assessment for Effective…	4
International Journal of…	4
Journal of Experimental…	4
Journal of Psychoeducational…	4
Perceptual and Motor Skills	4
Applied Measurement in…	3
Assessment in Education:…	3
Canadian Modern Language…	3
Grantee Submission	3
International Journal of…	3
Measurement and Evaluation in…	3
Physical Review Physics…	3
ProQuest LLC	3
More ▼

Schriesheim, Chester A.	7
Hambleton, Ronald K.	5
Stansfield, Charles W.	5
Benson, Jeri	4
Cheng, Liying	3
Federico, Pat-Anthony	3
Melancon, Janet G.	3
Read, John	3
Silverstein, A. B.	3
Straus, Murray A.	3
Thompson, Bruce	3
Wainer, Howard	3
Alderson, J. Charles	2
Allen, Nancy L.	2
Byrne, Barbara M.	2
Carcelli, Larry	2
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Henk, William A.	2
Herman, Joan	2
Huntley, Renee M.	2
More ▼

Journal Articles	318
Reports - Research	256
Reports - Evaluative	83
Reports - Descriptive	74
Speeches/Meeting Papers	70
Information Analyses	38
Opinion Papers	35
Guides - Non-Classroom	26
Tests/Questionnaires	20
Guides - Classroom - Teacher	9
Guides - General	6
Dissertations/Theses -…	4
Books	3
Collected Works - General	3
Numerical/Quantitative Data	3
Reference Materials -…	3
ERIC Publications	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
More ▼

Test of English as a Foreign…	9
SAT (College Admission Test)	6
International English…	5
Wechsler Adult Intelligence…	5
Beck Depression Inventory	4
Minnesota Multiphasic…	4
National Assessment of…	4
National Teacher Examinations	4
ACT Assessment	3
Armed Services Vocational…	3
Embedded Figures Test	3
Program for International…	3
Stanford Achievement Tests	3
Wechsler Intelligence Scale…	3
Graduate Record Examinations	2
Kaufman Brief Intelligence…	2
Keymath Diagnostic Arithmetic…	2
Peabody Picture Vocabulary…	2
Wechsler Individual…	2
Wechsler Intelligence Scales…	2
Woodcock Reading Mastery Test	2
Armed Forces Qualification…	1
Bar Examinations	1
Behavior Assessment System…	1
California Achievement Tests	1
More ▼