ERIC - Search Results

Publication Date

In 2026	0
Since 2025	11
Since 2022 (last 5 years)	55
Since 2017 (last 10 years)	126
Since 2007 (last 20 years)	212

Descriptor

Difficulty Level	379
Item Analysis	379
Test Items	302
Test Construction	109
Foreign Countries	107
Multiple Choice Tests	86
Test Validity	76
Item Response Theory	67
Comparative Analysis	65
Test Reliability	64
Statistical Analysis	63
Latent Trait Theory	55
Achievement Tests	53
Scores	49
Correlation	47
Higher Education	42
Mathematical Models	41
Test Format	39
Mathematics Tests	38
Second Language Learning	37
Language Tests	36
Psychometrics	36
Computer Assisted Testing	35
English (Second Language)	34
Test Bias	34
More ▼

Publication Type

Reports - Research	379
Journal Articles	239
Speeches/Meeting Papers	72
Tests/Questionnaires	23
Numerical/Quantitative Data	4
Guides - Non-Classroom	3
Information Analyses	3
Collected Works - General	1
Opinion Papers	1
Reports - Evaluative	1

Education Level

Higher Education	70
Postsecondary Education	62
Secondary Education	46
Elementary Education	36
High Schools	17
Middle Schools	15
Early Childhood Education	9
Primary Education	9
Junior High Schools	8
Elementary Secondary Education	7
Intermediate Grades	7
Grade 12	5
Grade 6	5
Grade 8	5
Grade 1	4
Kindergarten	4
Grade 10	3
Grade 5	3
Grade 7	3
Grade 2	2
Grade 3	2
Grade 11	1
Grade 4	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Researchers	33
Practitioners	1

Location

Indonesia	8
Germany	7
Nigeria	7
Turkey	7
India	5
South Africa	5
Taiwan	5
Canada	4
China	4
Florida	4
Europe	3
Finland	3
Japan	3
Malaysia	3
New York	3
United States	3
France	2
Georgia	2
Idaho	2
Illinois	2
Iran	2
Mexico	2
Netherlands	2
Russia	2
Saudi Arabia	2
More ▼

Laws, Policies, & Programs

Education Consolidation…	1
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 379 results Save | Export

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

Using ACER ConQuest Program to Examine Multidimensional and Many-Facet Models

Peer reviewed
PDF on ERIC

Download full text

Mahmut Sami Koyuncu; Mehmet Sata – International Journal of Assessment Tools in Education, 2023

The main aim of this study was to introduce the ConQuest program, which is used in the analysis of multivariate and multidimensional data structures, and to show its applications on example data structures. To achieve this goal, a basic research approach was applied. Thus, how to use the ConQuest program and how to prepare the data set for…

Descriptors: Data Analysis, Computer Oriented Programs, Models, Test Items

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Developing a Systems Thinking Skills Assessment for Upper Primary Students in Thailand

Peer reviewed
PDF on ERIC

Download full text

Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025

Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…

Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

The Knowledge of Autism Questionnaire-UK: Development and Initial Psychometric Evaluation

Peer reviewed

Direct link

Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025

Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…

Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics

Peer reviewed

Direct link

Cui-Yan Hoe; Chieh-Yu Chen; Ching-I Chen – Infants and Young Children, 2025

The Ages and Stages Questionnaires: Social-Emotional, Second Edition (ASQ:SE-2) has been translated into Traditional Chinese (ASQ:SE-2-TC) in Taiwan. This study investigated whether the ASQ:SE-2-TC is also suitable for use in Malaysian Chinese families, and if any cultural differences are presented in ASQ:SE-2-TC items. This study analyzed the…

Descriptors: Social Emotional Learning, Child Development, Screening Tests, Item Analysis

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

Development of a Four-Tier Diagnostic Test for Misconceptions in Natural Science of Primary School Pupils

Peer reviewed
PDF on ERIC

Download full text

Anatri Desstya; Ika Candra Sayekti; Muhammad Abduh; Sukartono – Journal of Turkish Science Education, 2025

This study aimed to develop a standardised instrument for diagnosing science misconceptions in primary school children. Following a developmental research approach using the 4-D model (Define, Design, Develop, Disseminate), 100 four-tier multiple choice items were constructed. Content validity was established through expert evaluation by six…

Descriptors: Test Construction, Science Tests, Science Instruction, Diagnostic Tests

Deep-IRT with Independent Student and Item Networks

Peer reviewed
PDF on ERIC

Download full text

Tsutsumi, Emiko; Kinoshita, Ryo; Ueno, Maomi – International Educational Data Mining Society, 2021

Knowledge tracing (KT), the task of tracking the knowledge state of each student over time, has been assessed actively by artificial intelligence researchers. Recent reports have described that Deep-IRT, which combines Item Response Theory (IRT) with a deep learning model, provides superior performance. It can express the abilities of each student…

Descriptors: Item Response Theory, Prediction, Accuracy, Artificial Intelligence

Taming the APA Style Writing Beast: Outcomes from a Structured Workshop for Newly Enrolled MSW Students

Peer reviewed

Direct link

Stan L. Bowie; Darrell R. Walsh – Journal of Teaching in Social Work, 2024

The study examined (1) the extent of APA Style writing knowledge and understanding among a purposive sample (N = 118) of incoming MSW students; (2) determined the impact of a structured workshop on their level of APA knowledge; and (3) examined the influence of undergraduate academic major on level of knowledge and understanding of APA Style…

Descriptors: Writing Workshops, Counselor Training, Guides, Masters Programs

Development of Ecology Achievement Test for Secondary School Students

Peer reviewed
PDF on ERIC

Download full text

Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024

This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…

Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 26

Educational and Psychological…	16
Journal of Educational…	12
ETS Research Report Series	11
Language Assessment Quarterly	8
Language Testing	7
Online Submission	7
Applied Measurement in…	6
Physical Review Physics…	6
International Journal of…	4
International Journal of…	4
SAGE Open	4
Advances in Health Sciences…	3
CBE - Life Sciences Education	3
Educational Measurement:…	3
Journal of Education and…	3
Journal of Experimental…	3
Journal of Speech, Language,…	3
Learning and Individual…	3
Physical Review Special…	3
African Journal of Research…	2
Applied Psychological…	2
Assessment & Evaluation in…	2
Assessment for Effective…	2
British Journal of…	2
Cogent Education	2
More ▼

Reckase, Mark D.	6
Roid, Gale	4
Cahen, Leonard S.	3
Dorans, Neil J.	3
Facon, Bruno	3
Hambleton, Ronald K.	3
Papageorgiou, Spiros	3
Plake, Barbara S.	3
Prestwood, J. Stephen	3
Retnawati, Heri	3
Smith, Richard M.	3
Tollefson, Nona	3
Weiss, David J.	3
Ackerman, Terry A.	2
Albano, Anthony D.	2
Alderson, J. Charles	2
Apino, Ezi	2
Ariel, Robert	2
Benjamin W. Domingue	2
Bennett, Randy Elliot	2
Benson, Jeri	2
Bichi, Ado Abdu	2
Bowles, Ryan P.	2
Bucak, S. Deniz	2
More ▼

SAT (College Admission Test)	10
Program for International…	7
Test of English as a Foreign…	7
Graduate Record Examinations	6
Peabody Picture Vocabulary…	4
California Achievement Tests	3
Stanford Achievement Tests	3
Armed Services Vocational…	2
Matching Familiar Figures Test	2
New Jersey College Basic…	2
Sequential Tests of…	2
ACT Assessment	1
Ages and Stages Questionnaires	1
Boehm Test of Basic Concepts	1
Cattell Culture Fair…	1
Communication and Symbolic…	1
Comprehensive Tests of Basic…	1
Digit Span Test	1
Flesch Kincaid Grade Level…	1
Flesch Reading Ease Formula	1
Graduate Management Admission…	1
International English…	1
Massachusetts Comprehensive…	1
Medical College Admission Test	1
Michigan Test of English…	1
More ▼