ERIC - Search Results

Publication Date

In 2026	0
Since 2025	190
Since 2022 (last 5 years)	1057
Since 2017 (last 10 years)	2567
Since 2007 (last 20 years)	4928

Descriptor

Test Items	9520
Test Construction	2711
Foreign Countries	2176
Item Response Theory	1867
Difficulty Level	1620
Item Analysis	1501
Test Validity	1412
Test Reliability	1186
Multiple Choice Tests	1152
Scores	1135
Computer Assisted Testing	1054
Comparative Analysis	1024
Test Format	956
Higher Education	876
Achievement Tests	852
Statistical Analysis	850
Mathematics Tests	845
Psychometrics	832
Test Bias	769
Models	752
Student Evaluation	736
Language Tests	699
Correlation	695
Evaluation Methods	674
Scoring	633
More ▼

Author

van der Linden, Wim J.	69
Tindal, Gerald	50
Hambleton, Ronald K.	45
Alonzo, Julie	41
Chang, Hua-Hua	40
Plake, Barbara S.	40
Sinharay, Sandip	37
Reckase, Mark D.	36
Wainer, Howard	33
Dorans, Neil J.	32
Gierl, Mark J.	30
Sireci, Stephen G.	28
Wang, Wen-Chung	26
Cohen, Allan S.	25
Meijer, Rob R.	25
Samejima, Fumiko	24
Stocking, Martha L.	24
Anderson, Daniel	23
Zwick, Rebecca	23
Veldkamp, Bernard P.	22
Haladyna, Thomas M.	21
Kim, Seock-Ho	21
Wise, Steven L.	21
Kim, Sooyeon	20
More ▼

Education Level

Higher Education	1306
Postsecondary Education	1056
Secondary Education	924
Elementary Education	713
Middle Schools	418
High Schools	362
Elementary Secondary Education	358
Junior High Schools	319
Grade 8	255
Intermediate Grades	208
Grade 4	182
Early Childhood Education	176
Grade 5	133
Primary Education	125
Grade 7	113
Grade 3	110
Grade 6	107
Grade 9	68
Grade 2	56
Grade 10	52
Grade 12	52
Kindergarten	50
Adult Education	39
Grade 11	37
Grade 1	36
More ▼

Audience

Practitioners	653
Teachers	563
Researchers	250
Students	201
Administrators	81
Policymakers	22
Parents	17
Counselors	8
Community	7
Support Staff	3
Media Staff	1
More ▼

Location

Turkey	225
Canada	223
Australia	155
Germany	116
United States	99
China	90
Florida	86
Indonesia	82
Taiwan	78
United Kingdom	73
California	65
Japan	65
Netherlands	64
Iran	62
United Kingdom (England)	57
South Africa	48
Missouri	45
New York	45
Oklahoma	44
South Korea	44
Malaysia	42
Texas	42
Israel	37
Singapore	37
Sweden	37
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4
Does not meet standards	1

Showing 121 to 135 of 9,520 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Leveraging LLM Respondents for Item Evaluation: A Psychometric Analysis

Peer reviewed

Direct link

Yunting Liu; Shreya Bhandari; Zachary A. Pardos – British Journal of Educational Technology, 2025

Effective educational measurement relies heavily on the curation of well-designed item pools. However, item calibration is time consuming and costly, requiring a sufficient number of respondents to estimate the psychometric properties of items. In this study, we explore the potential of six different large language models (LLMs; GPT-3.5, GPT-4,…

Descriptors: Artificial Intelligence, Test Items, Psychometrics, Educational Assessment

Factorial Invariance of the Academic Procrastination Scale (APS) in University Students from Ecuador, Venezuela, and Peru

Peer reviewed

Direct link

Rodrigo Moreta-Herrera; Jacqueline Regatto-Bonifaz; Víctor Viteri-Miranda; María Gorety Rodríguez-Vieira; Giancarlo Magro-Lazo; Jose A. Rodas; Sergio Dominguez-Lara – Journal of Psychoeducational Assessment, 2025

Objective: Analyze the evidence of validity of scores of the Academic Procrastination Scale (APS), its measurement equivalence based on nationality, its reliability of the scores, and its validity in relation to other variables in university students from Ecuador, Venezuela, and Peru. Method: This paper involves a quantitative, descriptive,…

Descriptors: Measures (Individuals), Time Management, College Students, Foreign Countries

The Use of VLE Assessments in the Presence of AI Technology

Peer reviewed

Direct link

Mohamed Kara-Mohamed – Journal of Educational Technology Systems, 2025

(1) Context: The growing accessibility of Artificial Intelligence (AI) technology, such as ChatGPT, poses a challenge to the integrity of online assessments in higher education. As AI becomes more integrated into academic contexts, educators face the complex task of maintaining assessment standards particularly within modern Virtual Learning…

Descriptors: Artificial Intelligence, Virtual Classrooms, Computer Assisted Testing, Universities

YMCQ: Reasoning-Enhanced MCQ Generation

Peer reviewed

Direct link

Andreea Dutulescu; Stefan Ruseti; Denis Iorga; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2025

Automated multiple-choice question (MCQ) generation is valuable for scalable assessment and enhanced learning experiences. How-ever, existing MCQ generation methods face challenges in ensuring plausible distractors and maintaining answer consistency. This paper intro-duces a method for MCQ generation that integrates reasoning-based explanations…

Descriptors: Automation, Computer Assisted Testing, Multiple Choice Tests, Natural Language Processing

Examining Psychometric Properties of the Career Thoughts Inventory among Omani Adolescents: Applying Item Response Theory

Peer reviewed

Direct link

Yousef Abdelqader Abu shindi; Muna Abdullah Al-Bahrani – Psychology in the Schools, 2025

The current study examined the Career Thoughts Inventory (CTI (psychometric properties and its performance among a sample of 2366 adolescents; 1037 (45.4%) males and 1289 (54.5%) females. Item Response Theory (IRT) was applied to identify which CTI items proficiently contribute to a single proper measurement of CTI. IRT evaluates the amount of…

Descriptors: Adolescents, Foreign Countries, Measures (Individuals), Item Response Theory

Mathematical Knowledge Learning Trajectories: An International Comparative Study

Peer reviewed

Direct link

Bing Jia; Zhemin Zhu – SAGE Open, 2025

Learning trajectories refer to the progression of students' knowledge acquisition and skill development within a specific domain. By identifying students' proficiency levels across specific knowledge points or attributes, cognitive diagnostic models (CDMs) allow researchers to systematically describe and analyze learning trajectories. This study…

Descriptors: Learning Trajectories, Grade 8, Middle School Mathematics, Achievement Tests

Automated Scoring of Constructed Response Items in Math Assessment Using Large Language Models

Peer reviewed

Direct link

Wesley Morris; Langdon Holmes; Joon Suh Choi; Scott Crossley – International Journal of Artificial Intelligence in Education, 2025

Recent developments in the field of artificial intelligence allow for improved performance in the automated assessment of extended response items in mathematics, potentially allowing for the scoring of these items cheaply and at scale. This study details the grand prize-winning approach to developing large language models (LLMs) to automatically…

Descriptors: Automation, Computer Assisted Testing, Mathematics Tests, Scoring

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Assessment of Method Effects of Keying and Wording in Instruments: A Mixed-Methods Explanatory Sequential Study

Direct link

Lin Ma – ProQuest LLC, 2024

This dissertation presents an innovative approach to examining the keying method, wording method, and construct validity on psychometric instruments. By employing a mixed methods explanatory sequential design, the effects of keying and wording in two psychometric assessments were examined and validated. Those two self-report psychometric…

Descriptors: Evaluation, Psychometrics, Measures (Individuals), Instrumentation

Students' Acceptance of and Preferences Regarding Online Exams: A Systematic Literature Review

Peer reviewed

Direct link

Arif Cem Topuz; Kinshuk – Educational Technology Research and Development, 2024

Online assessments of learning, or online exams, have become increasingly widespread with the rise of distance learning. Online exams are preferred by many students and are perceived as a quick and easy tool to measure knowledge. On the contrary, some students are concerned about the possibility of cheating and technological difficulties in online…

Descriptors: Computer Assisted Testing, Student Evaluation, Evaluation Methods, Student Attitudes

A Novel Framework for the Generation of Multiple Choice Question Stems Using Semantic and Machine-Learning Techniques

Peer reviewed

Direct link

Archana Praveen Kumar; Ashalatha Nayak; Manjula Shenoy K.; Chaitanya; Kaustav Ghosh – International Journal of Artificial Intelligence in Education, 2024

Multiple Choice Questions (MCQs) are a popular assessment method because they enable automated evaluation, flexible administration and use with huge groups. Despite these benefits, the manual construction of MCQs is challenging, time-consuming and error-prone. This is because each MCQ is comprised of a question called the "stem", a…

Descriptors: Multiple Choice Tests, Test Construction, Test Items, Semantics

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

« Previous Page | Next Page »

Pages: 1 | ... | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | ... | 635

Educational and Psychological…	416
Journal of Educational…	359
ProQuest LLC	246
Applied Psychological…	234
Applied Measurement in…	231
ETS Research Report Series	146
Educational Measurement:…	128
Journal of Educational and…	122
Online Submission	115
International Journal of…	105
Grantee Submission	97
Language Testing	93
Psychometrika	93
International Journal of…	79
Journal of Psychoeducational…	72
Educational Assessment	69
Measurement:…	57
Practical Assessment,…	56
Language Assessment Quarterly	55
Journal of Chemical Education	54
Behavioral Research and…	50
Journal of Experimental…	45
Physical Review Physics…	37
Journal of Experimental…	36
International Journal of…	35
More ▼

Journal Articles	5857
Reports - Research	5565
Reports - Evaluative	1556
Speeches/Meeting Papers	1168
Reports - Descriptive	796
Tests/Questionnaires	768
Guides - Classroom - Teacher	472
Guides - Non-Classroom	259
Dissertations/Theses -…	251
Numerical/Quantitative Data	185
Information Analyses	179
Opinion Papers	164
Guides - Classroom - Learner	162
Books	54
Collected Works - General	33
Multilingual/Bilingual…	32
Guides - General	31
Reports - General	21
Book/Product Reviews	20
ERIC Publications	20
Non-Print Media	16
ERIC Digests in Full Text	14
Collected Works - Proceedings	13
Reference Materials - General	13
Collected Works - Serials	12
More ▼

No Child Left Behind Act 2001	36
Individuals with Disabilities…	20
Every Student Succeeds Act…	5
Elementary and Secondary…	4
Race to the Top	4
Rehabilitation Act 1973…	4
Elementary and Secondary…	3
Head Start	3
Americans with Disabilities…	2
Comprehensive Education…	2
Higher Education Act…	2
Immigration Reform and…	2
Civil Rights Act 1964	1
Civil Rights Act 1964 Title…	1
Comprehensive Employment and…	1
Education Consolidation…	1
Education for All Handicapped…	1
Fair Labor Standards Act	1
Higher Education Act Title II	1
Higher Education Opportunity…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Jeanne Clery Disclosure of…	1
Job Training Partnership Act…	1
Kentucky Education Reform Act…	1
More ▼

National Assessment of…	182
Program for International…	177
SAT (College Admission Test)	137
Trends in International…	114
Test of English as a Foreign…	85
Graduate Record Examinations	74
ACT Assessment	44
Advanced Placement…	34
Texas Educational Assessment…	32
Law School Admission Test	30
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	25
Progress in International…	25
Stanford Achievement Tests	24
Raven Progressive Matrices	22
Armed Services Vocational…	20
International English…	20
Peabody Picture Vocabulary…	20
California Achievement Tests	18
Comprehensive Tests of Basic…	18
Test of English for…	17
Metropolitan Achievement Tests	15
General Educational…	14
Graduate Management Admission…	14
Wechsler Adult Intelligence…	13
More ▼