ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	24
Since 2006 (last 20 years)	55

Descriptor

Item Analysis	92
Test Items	92
Test Construction	35
Item Response Theory	21
Test Validity	19
Computer Assisted Testing	17
Difficulty Level	17
Multiple Choice Tests	14
Test Reliability	14
Elementary Secondary Education	12
Foreign Countries	12
Mathematics Tests	12
Scoring	12
Computer Software	11
Psychometrics	11
Achievement Tests	10
Educational Assessment	10
Models	10
Higher Education	9
Scores	9
Test Bias	9
Classification	8
Comparative Analysis	8
Statistical Analysis	8
Test Theory	8
More ▼

Publication Type

Reports - Descriptive	92
Journal Articles	60
Numerical/Quantitative Data	7
Speeches/Meeting Papers	7
Tests/Questionnaires	6
Guides - Non-Classroom	5
Opinion Papers	3
Reports - Research	3
Collected Works - General	2
Computer Programs	2
Historical Materials	1
More ▼

Education Level

Elementary Secondary Education	14
Elementary Education	10
Grade 4	8
Higher Education	8
Grade 6	7
Grade 8	7
Middle Schools	7
Grade 5	6
Grade 7	5
Intermediate Grades	5
Junior High Schools	5
Secondary Education	5
Grade 3	4
High Schools	4
Postsecondary Education	4
Early Childhood Education	3
Primary Education	3
Grade 9	2
Adult Education	1
Grade 12	1
Kindergarten	1
More ▼

Audience

Teachers	6
Researchers	4
Practitioners	3
Administrators	2
Counselors	1

Location

Australia	2
California	2
Massachusetts	2
Puerto Rico	2
China	1
Czech Republic	1
Georgia	1
Idaho	1
Italy	1
Japan	1
Malaysia	1
Maryland	1
Missouri	1
New Jersey	1
Oregon	1
Switzerland	1
Turkey	1
United Kingdom (Edinburgh)	1
Washington	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
National Defense Education Act	1

Assessments and Surveys

Trends in International…	2
Eysenck Personality Inventory	1
Graduate Record Examinations	1
Hopkins Symptom Checklist	1
National Assessment of…	1
New Jersey High School…	1
Program for International…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 92 results Save | Export

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Improving Mathematics Diagnostic Tests Using Item Analysis

Peer reviewed

Direct link

Meike Akveld; George Kinnear – International Journal of Mathematical Education in Science and Technology, 2024

Many universities use diagnostic tests to assess incoming students' preparedness for mathematics courses. Diagnostic test results can help students to identify topics where they need more practice and give lecturers a summary of strengths and weaknesses in their class. We demonstrate a process that can be used to make improvements to a mathematics…

Descriptors: Mathematics Tests, Diagnostic Tests, Test Items, Item Analysis

Essentials of Visual Diagnosis of Test Items. Logical, Illogical, and Anomalous Patterns in Tests Items to Be Detected

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

This article discusses visual techniques for detecting test items that would be optimal to be selected to the final compilation on the one hand and, on the other hand, to out-select those items that would lower the quality of the compilation. Some classic visual tools are discussed, first, in a practical manner in diagnosing the logical,…

Descriptors: Test Items, Item Analysis, Item Response Theory, Cutting Scores

Using Full-Information Item Analysis to Improve Item Quality

Peer reviewed

Direct link

Haladyna, Thomas M.; Rodriguez, Michael C. – Educational Assessment, 2021

Full-information item analysis provides item developers and reviewers comprehensive empirical evidence of item quality, including option response frequency, point-biserial index (PBI) for distractors, mean-scores of respondents selecting each option, and option trace lines. The multi-serial index (MSI) is introduced as a more informative…

Descriptors: Test Items, Item Analysis, Reading Tests, Mathematics Tests

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

NLP-Based Management of Large Multiple-Choice Test Item Repositories

Peer reviewed
PDF on ERIC

Download full text

Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023

Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…

Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Figure-Based Approach in Creating ChatGPT-4o-Resistant Multiple-Choice Questions for Introductory Biology Courses: An Instructional Guide

Peer reviewed
PDF on ERIC

Download full text

Kyeng Gea Lee; Mark J. Lee; Soo Jung Lee – International Journal of Technology in Education and Science, 2024

Online assessment is an essential part of online education, and if conducted properly, has been found to effectively gauge student learning. Generally, textbased questions have been the cornerstone of online assessment. Recently, however, the emergence of generative artificial intelligence has added a significant challenge to the integrity of…

Descriptors: Artificial Intelligence, Computer Software, Biology, Science Instruction

Harmonizing Depression Measures across Studies: A Tutorial for Data Harmonization

Peer reviewed

Direct link

Zhao, Xin; Coxe, Stefany; Sibley, Margaret H.; Zulauf-McCurdy, Courtney; Pettit, Jeremy W. – Prevention Science, 2023

There has been increasing interest in applying integrative data analysis (IDA) to analyze data across multiple studies to increase sample size and statistical power. Measures of a construct are frequently not consistent across studies. This article provides a tutorial on the complex decisions that occur when conducting harmonization of measures…

Descriptors: Data Analysis, Sample Size, Decision Making, Test Items

Digital Module 08: Foundations of Operational Item Analysis https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Yoo, Hanwook; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 2019

Item analysis is an integral part of operational test development and is typically conducted within two popular statistical frameworks: classical test theory (CTT) and item response theory (IRT). In this digital ITEMS module, Hanwook Yoo and Ronald K. Hambleton provide an accessible overview of operational item analysis approaches within these…

Descriptors: Item Analysis, Item Response Theory, Guidelines, Test Construction

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

A Framework to Evaluate Cognitive Complexity in Science Assessments

Download full text

Achieve, Inc., 2019

Assessment is a key lever for educational improvement. Assessments can be used to monitor, signal, and influence science teaching and learning -- provided that they are of high quality, reflect the rigor and intent of academic standards, and elicit meaningful student performances. Since the release of "A Framework for K-12 Science…

Descriptors: Difficulty Level, Evaluation Criteria, Cognitive Processes, Test Items

Easier Said than Done: Rejoinder on Sijtsma and on Green and Yang

Peer reviewed

Direct link

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U. – Educational Measurement: Issues and Practice, 2016

The main points of Sijtsma and Green and Yang in Educational Measurement: Issues and Practice (34, 4) are that reliability, internal consistency, and unidimensionality are distinct and that Cronbach's alpha may be problematic. Neither of these assertions are at odds with Davenport, Davison, Liou, and Love in the same issue. However, many authors…

Descriptors: Educational Assessment, Reliability, Validity, Test Construction

Computerized Adaptive Test (CAT) Applications and Item Response Theory Models for Polytomous Items

Peer reviewed
PDF on ERIC

Download full text

Aybek, Eren Can; Demirtasli, R. Nukhet – International Journal of Research in Education and Science, 2017

This article aims to provide a theoretical framework for computerized adaptive tests (CAT) and item response theory models for polytomous items. Besides that, it aims to introduce the simulation and live CAT software to the related researchers. Computerized adaptive test algorithm, assumptions of item response theory models, nominal response…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Evaluating the Unintended Consequences of Assessment Practices: Construct Irrelevance and Construct Underrepresentation

Peer reviewed

Direct link

Spurgeon, Shawn L. – Measurement and Evaluation in Counseling and Development, 2017

Construct irrelevance (CI) and construct underrepresentation (CU) are 2 major threats to validity, yet they are rarely discussed within the counseling literature. This article provides information about the relevance of these threats to internal validity. An illustrative case example will be provided to assist counselors in understanding these…

Descriptors: Construct Validity, Evaluation Criteria, Evaluation Methods, Evaluation Problems

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational Measurement:…	5
International Journal of…	3
Journal of Educational…	3
Journal of Educational and…	3
Practical Assessment,…	3
Psychometrika	3
Achieve, Inc.	2
Educational Assessment	2
Educational and Psychological…	2
International Association for…	2
Journal of Chemical Education	2
Language Testing	2
National Assessment Governing…	2
National Center for Education…	2
New Meridian Corporation	2
Structural Equation Modeling:…	2
Acta Educationis Generalis	1
Applied Psychological…	1
Arithmetic Teacher	1
Astronomy Education Review	1
Behavioral Research and…	1
Collegiate Microcomputer	1
Education Week	1
Educational Technology	1
Evaluation News	1
More ▼

Ahmed, S.	2
Baxter, G. P.	2
Ketterlin-Geller, Leanne R.	2
Liu, Kimy	2
Salvucci, S.	2
Sikali, E.	2
Sloan, M.	2
Waits, T.	2
Abbott, Marilyn L.	1
Acquaye, Rosemary	1
Anderson, Carolyn J.	1
Anna Lucia Paoletti	1
Arora, Alka, Ed.	1
Aybek, Eren Can	1
Bardar, Erin M.	1
Bechger, Timo M.	1
Beddow, Peter A.	1
Berk, Ronald A.	1
Bichi, Ado Abdu	1
Bolsinova, Maria	1
Bradlow, Eric T.	1
Brecher, Kenneth	1
Brown, Annie	1
Burkett, Allan R.	1
More ▼