ERIC - Search Results

Publication Date

In 2026	0
Since 2025	74
Since 2022 (last 5 years)	509
Since 2017 (last 10 years)	1084
Since 2007 (last 20 years)	2603

Descriptor

Item Analysis	5169
Test Items	1501
Foreign Countries	1232
Test Construction	1179
Test Validity	957
Factor Analysis	929
Test Reliability	897
Psychometrics	769
Correlation	703
Statistical Analysis	646
Comparative Analysis	598
Measures (Individuals)	565
Item Response Theory	524
Scores	496
Difficulty Level	492
Questionnaires	486
Evaluation Methods	470
Student Attitudes	436
Achievement Tests	414
Higher Education	402
Multiple Choice Tests	393
Measurement Techniques	372
Validity	350
Models	342
Reliability	341
More ▼

Education Level

Higher Education	855
Postsecondary Education	580
Secondary Education	422
Elementary Education	315
Elementary Secondary Education	205
High Schools	191
Middle Schools	177
Junior High Schools	119
Adult Education	103
Early Childhood Education	103
Grade 4	66
Grade 8	65
Intermediate Grades	63
Grade 5	61
Primary Education	49
Grade 6	47
Preschool Education	45
Grade 7	43
Kindergarten	38
Grade 3	35
Grade 9	28
Grade 10	23
Grade 12	23
Grade 1	18
Grade 11	16
More ▼

Audience

Researchers	169
Practitioners	49
Teachers	32
Administrators	8
Policymakers	8
Counselors	4
Students	4
Media Staff	1

Location

Turkey	173
Australia	81
Canada	79
China	72
United States	56
Taiwan	44
Germany	43
Japan	41
United Kingdom	39
Iran	37
Indonesia	35
Spain	33
California	31
Netherlands	28
South Korea	28
United Kingdom (England)	28
Hong Kong	26
India	25
Florida	24
Malaysia	24
New York	24
Singapore	20
Israel	18
Nigeria	18
Pennsylvania	17
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	18
Individuals with Disabilities…	15
Elementary and Secondary…	5
Elementary and Secondary…	2
Every Student Succeeds Act…	2
Deferred Action for Childhood…	1
Education Amendments 1974	1
Education Consolidation…	1
Emergency School Aid Act 1972	1
Individuals with Disabilities…	1
National Defense Education Act	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Showing 181 to 195 of 5,169 results Save | Export

NLP-Based Management of Large Multiple-Choice Test Item Repositories

Peer reviewed
PDF on ERIC

Download full text

Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023

Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…

Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis

Purification Procedures Used for the Detection of Gender DIF: Item Bias in a Foreign Language Test

Peer reviewed
PDF on ERIC

Download full text

Serap Buyukkidik – International Journal of Assessment Tools in Education, 2023

In the current study, differential item functioning (DIF) detection using real data was conducted with the application of "Mantel-Haenszel (MH)", "Simultaneous item bias test (SIBTEST)", "Lord's chi-square", and "Raju's area" methods, both when item purification was carried out and when item purification was…

Descriptors: Language Tests, Test Items, Item Analysis, Gender Differences

Get REAL: Development and Validation of the Rubric for the Evaluation of Apps for Learning

Peer reviewed

Direct link

Melanie Ann Weber; Mia Anzilotti; Reece Gormley; Christina Huber; Alyssa McGarvey; Grace McKee; Claire Ogden; Hannah Seinfeld; Julia Wank; Arnold Olszewski – Perspectives of the ASHA Special Interest Groups, 2024

Purpose: Technology, including educational applications (apps), is commonly used in schools by teachers and speech-language pathologists. Nonetheless, very little research has examined the efficacy of these apps for student learning or how to choose appropriate apps for instruction. Several previous rubrics to evaluate the instructional quality of…

Descriptors: Computer Software, Handheld Devices, Educational Technology, Technology Uses in Education

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

A Rasch-Based Validation of the University of Tehran English Proficiency Test (UTEPT)

Peer reviewed

Direct link

Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024

Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…

Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency

Improving Item-Exposure Control in Adaptive Testing

Peer reviewed

Direct link

van der Linden, Wim J.; Choi, Seung W. – Journal of Educational Measurement, 2020

One of the methods of controlling test security in adaptive testing is imposing random item-ineligibility constraints on the selection of the items with probabilities automatically updated to maintain a predetermined upper bound on the exposure rates. Three major improvements of the method are presented. First, a few modifications to improve the…

Descriptors: Adaptive Testing, Item Response Theory, Feedback (Response), Item Analysis

Latent "D"-Scoring Modeling: Estimation of Item and Person Parameters

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2021

This study presents a latent (item response theory--like) framework of a recently developed classical approach to test scoring, equating, and item analysis, referred to as "D"-scoring method. Specifically, (a) person and item parameters are estimated under an item response function model on the "D"-scale (from 0 to 1) using…

Descriptors: Scoring, Equated Scores, Item Analysis, Item Response Theory

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Psychometric Evaluation of Perceived Internship PUA Scale: Using Rasch Analysis

Peer reviewed

Direct link

Yanchao Yang; Wangze Li; Sijia Xue; Wenxue Huang; Shijie Guo – European Journal of Education, 2025

In response to the prevalence of perceived internship Pick-up Artist(PUA) behaviours and the lack of appropriate measurement tools, the purpose of this study was to develop and validate a new self-designed questionnaire, the Perceived Internship PUA Scale (PIPUAS), to assess college student interns' perceptions of internship PUA behaviours. The…

Descriptors: Measurement Techniques, Incidence, Internship Programs, Validity

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

IRT Analysis of the Grit-S Scale: Evaluation with Racially/Ethnically Diverse Adolescents in the US

Peer reviewed

Direct link

Antoinette Y. Farmer; Yuhan Wei; Adrian Gale; N. Andrew Peterson – Journal of Psychoeducational Assessment, 2025

Objective: The factor structure of the Grit-S is the subject of much debate. The purpose of this study was to examine the factor structure of the Grit-S and validate its psychometric properties among racially/ethically minoritized adolescents using Item Response Theory (IRT). Method: Data were collected from 651 racially/ethnically minoritized…

Descriptors: Item Response Theory, Item Analysis, Self Efficacy, Personality Traits

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Peer reviewed

Direct link

Jiayi Deng – Large-scale Assessments in Education, 2025

Background: Test score comparability in international large-scale assessments (LSAs) is greatly important to ensure test fairness. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic version of test forms into a common score scale. An example is the multigroup…

Descriptors: Guessing (Tests), Item Response Theory, Error Patterns, Arabic

Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference

Peer reviewed

Direct link

Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022

The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…

Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Comparison of Two Exam Evaluation Methods for Objectivity

Peer reviewed
PDF on ERIC

Download full text

Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022

The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…

Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction

« Previous Page | Next Page »

Pages: 1 | ... | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | ... | 345

Educational and Psychological…	274
Journal of Educational…	132
ProQuest LLC	125
Online Submission	72
Psychometrika	70
Applied Psychological…	60
Journal of Psychoeducational…	60
Language Testing	45
Applied Measurement in…	40
Educ Psychol Meas	34
ETS Research Report Series	33
Journal of Experimental…	33
Educational Measurement:…	31
Journal of Experimental…	31
Measurement and Evaluation in…	31
Grantee Submission	29
Journal of Consulting and…	29
Educational Sciences: Theory…	28
Journal of Educational and…	28
Psychological Assessment	27
International Journal of…	26
Physical Review Physics…	26
International Journal of…	24
Research on Social Work…	24
Multivariate Behavioral…	23
More ▼

Hambleton, Ronald K.	21
Reckase, Mark D.	20
Tindal, Gerald	15
Weiss, David J.	14
Lord, Frederic M.	13
Bart, William M.	12
Plake, Barbara S.	12
van der Linden, Wim J.	12
Alonzo, Julie	11
Dorans, Neil J.	11
Samejima, Fumiko	10
Dawis, Rene V.	9
Harnisch, Delwyn L.	9
McKinley, Robert L.	9
Rudner, Lawrence M.	9
Wainer, Howard	9
Wright, Benjamin D.	9
Angoff, William H.	8
Baker, Eva L.	8
Gierl, Mark J.	8
Matson, Johnny L.	8
Raykov, Tenko	8
Sireci, Stephen G.	8
More ▼

Reports - Research	3167
Journal Articles	3146
Reports - Evaluative	626
Speeches/Meeting Papers	429
Reports - Descriptive	300
Tests/Questionnaires	271
Dissertations/Theses -…	127
Numerical/Quantitative Data	87
Information Analyses	86
Guides - Non-Classroom	66
Opinion Papers	54
Books	16
Guides - Classroom - Teacher	15
Guides - General	13
Collected Works - General	11
Reports - General	7
Reference Materials -…	6
Non-Print Media	5
Collected Works - Serials	4
Computer Programs	4
Collected Works - Proceedings	3
Dissertations/Theses	3
Guides - Classroom - Learner	3
Historical Materials	3
Legal/Legislative/Regulatory…	3
More ▼

SAT (College Admission Test)	53
Program for International…	49
National Assessment of…	40
Trends in International…	31
Test of English as a Foreign…	30
Minnesota Multiphasic…	23
California Achievement Tests	22
Graduate Record Examinations	22
Stanford Achievement Tests	22
Iowa Tests of Basic Skills	21
Wechsler Intelligence Scale…	20
ACT Assessment	18
Peabody Picture Vocabulary…	18
Metropolitan Achievement Tests	15
Raven Progressive Matrices	14
Wechsler Adult Intelligence…	14
Armed Services Vocational…	13
Stanford Binet Intelligence…	13
Beck Depression Inventory	10
Autism Diagnostic Observation…	9
National Longitudinal Study…	9
Comprehensive Tests of Basic…	8
Eysenck Personality Inventory	7
International English…	7
Pennsylvania Educational…	7
More ▼