ERIC - Search Results

Publication Date

In 2025	5
Since 2024	12
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	139
Since 2006 (last 20 years)	206

Descriptor

Difficulty Level	371
Item Analysis	371
Test Items	295
Test Construction	104
Foreign Countries	102
Multiple Choice Tests	82
Test Validity	71
Item Response Theory	66
Comparative Analysis	64
Statistical Analysis	63
Test Reliability	59
Latent Trait Theory	55
Achievement Tests	53
Scores	48
Correlation	46
Higher Education	42
Mathematical Models	41
Test Format	38
Mathematics Tests	37
Second Language Learning	36
Computer Assisted Testing	35
Language Tests	35
Test Bias	34
English (Second Language)	33
Psychometrics	33
More ▼

Publication Type

Reports - Research	371
Journal Articles	231
Speeches/Meeting Papers	72
Tests/Questionnaires	22
Numerical/Quantitative Data	4
Guides - Non-Classroom	3
Information Analyses	3
Collected Works - General	1
Opinion Papers	1
Reports - Evaluative	1

Education Level

Higher Education	66
Postsecondary Education	58
Secondary Education	45
Elementary Education	35
High Schools	16
Middle Schools	15
Early Childhood Education	9
Primary Education	9
Junior High Schools	8
Elementary Secondary Education	7
Intermediate Grades	7
Grade 12	5
Grade 6	5
Grade 8	5
Grade 1	4
Kindergarten	4
Grade 10	3
Grade 5	3
Grade 7	3
Grade 2	2
Grade 3	2
Grade 11	1
Grade 4	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Researchers	33
Practitioners	1

Location

Germany	7
Nigeria	7
Turkey	7
Indonesia	6
India	5
South Africa	5
Canada	4
Florida	4
Taiwan	4
China	3
Europe	3
Finland	3
Japan	3
New York	3
United States	3
France	2
Georgia	2
Idaho	2
Illinois	2
Iran	2
Malaysia	2
Mexico	2
Netherlands	2
Russia	2
Saudi Arabia	2
More ▼

Laws, Policies, & Programs

Education Consolidation…	1
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 371 results Save | Export

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

Using ACER ConQuest Program to Examine Multidimensional and Many-Facet Models

Peer reviewed
PDF on ERIC

Download full text

Mahmut Sami Koyuncu; Mehmet Sata – International Journal of Assessment Tools in Education, 2023

The main aim of this study was to introduce the ConQuest program, which is used in the analysis of multivariate and multidimensional data structures, and to show its applications on example data structures. To achieve this goal, a basic research approach was applied. Thus, how to use the ConQuest program and how to prepare the data set for…

Descriptors: Data Analysis, Computer Oriented Programs, Models, Test Items

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

Comparing the Score Interpretation across Modes in PISA: An Investigation of How Item Facets Affect Difficulty

Peer reviewed

Direct link

Harrison, Scott; Kroehne, Ulf; Goldhammer, Frank; Lüdtke, Oliver; Robitzsch, Alexander – Large-scale Assessments in Education, 2023

Background: Mode effects, the variations in item and scale properties attributed to the mode of test administration (paper vs. computer), have stimulated research around test equivalence and trend estimation in PISA. The PISA assessment framework provides the backbone to the interpretation of the results of the PISA test scores. However, an…

Descriptors: Scoring, Test Items, Difficulty Level, Foreign Countries

Developing a Systems Thinking Skills Assessment for Upper Primary Students in Thailand

Peer reviewed
PDF on ERIC

Download full text

Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025

Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…

Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Deep-IRT with Independent Student and Item Networks

Peer reviewed
PDF on ERIC

Download full text

Tsutsumi, Emiko; Kinoshita, Ryo; Ueno, Maomi – International Educational Data Mining Society, 2021

Knowledge tracing (KT), the task of tracking the knowledge state of each student over time, has been assessed actively by artificial intelligence researchers. Recent reports have described that Deep-IRT, which combines Item Response Theory (IRT) with a deep learning model, provides superior performance. It can express the abilities of each student…

Descriptors: Item Response Theory, Prediction, Accuracy, Artificial Intelligence

Evaluating the Effectiveness of a Computerized Achievement Test Using Learn Smart for Psychometric Assessment under Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025

This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…

Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory

The Experimental Study of the Effect of Functional-Variational Factors on the Results of Linguistic Testing

Peer reviewed
PDF on ERIC

Download full text

Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022

A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…

Descriptors: Student Evaluation, Language Tests, Test Format, Test Items

Adjustment for Guessing in a Basic Statistics Test for Indonesian Undergraduate Psychology Students Using the Rasch Model

Peer reviewed

Direct link

Hayat, Bahrul – Cogent Education, 2022

The purpose of this study comprises (1) calibrating the Basic Statistics Test for Indonesian undergraduate psychology students using the Rasch model, (2) testing the impact of adjustment for guessing on item parameters, person parameters, test reliability, and distribution of item difficulty and person ability, and (3) comparing person scores…

Descriptors: Guessing (Tests), Statistics Education, Undergraduate Students, Psychology

Taming the APA Style Writing Beast: Outcomes from a Structured Workshop for Newly Enrolled MSW Students

Peer reviewed

Direct link

Stan L. Bowie; Darrell R. Walsh – Journal of Teaching in Social Work, 2024

The study examined (1) the extent of APA Style writing knowledge and understanding among a purposive sample (N = 118) of incoming MSW students; (2) determined the impact of a structured workshop on their level of APA knowledge; and (3) examined the influence of undergraduate academic major on level of knowledge and understanding of APA Style…

Descriptors: Writing Workshops, Counselor Training, Guides, Masters Programs

Development of Ecology Achievement Test for Secondary School Students

Peer reviewed
PDF on ERIC

Download full text

Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024

This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…

Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Preliminary Findings to Support the Internal Consistency and Factor Structure of the Ferrari-Lynch-Vogel Listening Test (FLVLT)

Peer reviewed

Direct link

Ferrari-Bridgers, Franca – International Journal of Listening, 2023

While many tools exist to assess student content knowledge, there are few that assess whether students display the critical listening skills necessary to interpret the quality of a speaker's message at the college level. The following research provides preliminary evidence for the internal consistency and factor structure of a tool, the…

Descriptors: Factor Structure, Test Validity, Community College Students, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 25

Educational and Psychological…	16
Journal of Educational…	12
ETS Research Report Series	11
Language Assessment Quarterly	8
Language Testing	7
Online Submission	7
Applied Measurement in…	6
International Journal of…	4
International Journal of…	4
Physical Review Physics…	4
SAGE Open	4
Advances in Health Sciences…	3
CBE - Life Sciences Education	3
Educational Measurement:…	3
Journal of Education and…	3
Journal of Experimental…	3
Journal of Speech, Language,…	3
Learning and Individual…	3
Physical Review Special…	3
African Journal of Research…	2
Applied Psychological…	2
Assessment & Evaluation in…	2
Assessment for Effective…	2
British Journal of…	2
Cogent Education	2
More ▼

Reckase, Mark D.	6
Roid, Gale	4
Cahen, Leonard S.	3
Dorans, Neil J.	3
Facon, Bruno	3
Hambleton, Ronald K.	3
Papageorgiou, Spiros	3
Plake, Barbara S.	3
Prestwood, J. Stephen	3
Retnawati, Heri	3
Smith, Richard M.	3
Tollefson, Nona	3
Weiss, David J.	3
Ackerman, Terry A.	2
Albano, Anthony D.	2
Alderson, J. Charles	2
Apino, Ezi	2
Ariel, Robert	2
Benjamin W. Domingue	2
Bennett, Randy Elliot	2
Benson, Jeri	2
Bichi, Ado Abdu	2
Bowles, Ryan P.	2
Bucak, S. Deniz	2
More ▼

SAT (College Admission Test)	10
Program for International…	7
Test of English as a Foreign…	7
Graduate Record Examinations	6
Peabody Picture Vocabulary…	4
California Achievement Tests	3
Stanford Achievement Tests	3
Armed Services Vocational…	2
Matching Familiar Figures Test	2
New Jersey College Basic…	2
Sequential Tests of…	2
ACT Assessment	1
Boehm Test of Basic Concepts	1
Cattell Culture Fair…	1
Communication and Symbolic…	1
Comprehensive Tests of Basic…	1
Digit Span Test	1
Flesch Kincaid Grade Level…	1
Flesch Reading Ease Formula	1
Graduate Management Admission…	1
International English…	1
Massachusetts Comprehensive…	1
Medical College Admission Test	1
Michigan Test of English…	1
National Assessment of…	1
More ▼