ERIC - Search Results

Publication Date

In 2026	0
Since 2025	14
Since 2022 (last 5 years)	45
Since 2017 (last 10 years)	126
Since 2007 (last 20 years)	175

Descriptor

Difficulty Level	279
Test Items	279
Test Reliability	279
Test Validity	137
Test Construction	115
Foreign Countries	103
Item Response Theory	72
Multiple Choice Tests	70
Item Analysis	69
Psychometrics	47
Scores	42
Science Tests	34
Higher Education	31
Undergraduate Students	30
Correlation	28
Test Format	28
Achievement Tests	27
Comparative Analysis	27
High School Students	27
Mathematics Tests	25
Statistical Analysis	25
Elementary School Students	23
Scientific Concepts	23
Thinking Skills	22
Factor Analysis	21
More ▼

Publication Type

Reports - Research	230
Journal Articles	187
Speeches/Meeting Papers	29
Reports - Evaluative	23
Tests/Questionnaires	21
Reports - Descriptive	9
Dissertations/Theses -…	7
Numerical/Quantitative Data	5
Guides - Non-Classroom	2
Opinion Papers	2
Collected Works - Serials	1
Computer Programs	1
Guides - General	1
Information Analyses	1
Reports - General	1
More ▼

Education Level

Higher Education	61
Postsecondary Education	55
Secondary Education	52
Elementary Education	40
High Schools	25
Middle Schools	23
Junior High Schools	16
Early Childhood Education	10
Intermediate Grades	10
Primary Education	10
Grade 7	9
Grade 8	8
Kindergarten	8
Elementary Secondary Education	7
Grade 1	7
Grade 6	6
Grade 2	5
Grade 5	4
Grade 9	4
Grade 10	2
Grade 12	2
Grade 3	2
Grade 4	2
Grade 11	1
Preschool Education	1
More ▼

Audience

Researchers	7
Practitioners	2
Teachers	2

Location

Indonesia	21
Turkey	16
Germany	8
Florida	7
Canada	4
Japan	4
Nigeria	4
South Korea	4
United States	4
Australia	3
China	3
India	3
New York	3
Turkey (Istanbul)	3
Colorado	2
Georgia	2
Idaho	2
Indiana	2
Iran	2
Jordan	2
Norway	2
Taiwan	2
Thailand	2
Turkey (Ankara)	2
United Kingdom	2
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 279 results Save | Export

Seeking the Real Reliability: Why the Traditional Estimators of Reliability Usually Fail in Achievement Testing and Why the Deflation-Corrected Coefficients Could Be Better Options

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023

Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…

Descriptors: Test Reliability, Achievement Tests, Computation, Test Items

Comparative Evaluation of C-Test Reliability Using Classical and Modern Psychometric Methods

Peer reviewed
PDF on ERIC

Download full text

Neda Kianinezhad; Mohsen Kianinezhad – Language Education & Assessment, 2025

This study presents a comparative analysis of classical reliability measures, including Cronbach's alpha, test-retest, and parallel forms reliability, alongside modern psychometric methods such as the Rasch model and Mokken scaling, to evaluate the reliability of C-tests in language proficiency assessment. Utilizing data from 150 participants…

Descriptors: Psychometrics, Test Reliability, Language Proficiency, Language Tests

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Inventory of Galilean Transformation of Uniform Linear Motion in Position-Time Graphs

Peer reviewed

Direct link

E.?B. Merki; S.?I. Hofer; A. Vaterlaus; A. Lichtenberger – Physical Review Physics Education Research, 2025

When describing motion in physics, the selection of a frame of reference is crucial. The graph of a moving object can look quite different based on the frame of reference. In recent years, various tests have been developed to assess the interpretation of kinematic graphs, but none of these tests have specifically addressed differences in reference…

Descriptors: Graphs, Motion, Physics, Secondary School Students

Improvised Progressive Model Based on Automatic Calibration of Difficulty Level: A Practical Solution of Competitive-Based Examination

Peer reviewed

Direct link

Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024

Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…

Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction

The Influence of Representations on Task Difficulty in Organic Chemistry: An Exploration Using a Novel Paired-Items Test Instrument

Peer reviewed

Direct link

Martin Steinbach; Carolin Eitemüller; Marc Rodemer; Maik Walpuski – International Journal of Science Education, 2025

The intricate relationship between representational competence and content knowledge in organic chemistry has been widely debated, and the ways in which representations contribute to task difficulty, particularly in assessment, remain unclear. This paper presents a multiple-choice test instrument for assessing individuals' knowledge of fundamental…

Descriptors: Organic Chemistry, Difficulty Level, Multiple Choice Tests, Fundamental Concepts

Assessment of Item and Test Parameters: Cosine Similarity Approach

Peer reviewed
PDF on ERIC

Download full text

Chakrabartty, Satyendra Nath – International Journal of Psychology and Educational Studies, 2021

The paper proposes new measures of difficulty and discriminating values of binary items and test consisting of such items and find their relationships including estimation of test error variance and thereby the test reliability, as per definition using cosine similarities. The measures use entire data. Difficulty value of test and item is defined…

Descriptors: Test Items, Difficulty Level, Scores, Test Reliability

Assessing Lower-Secondary School Students' Critical Thinking Skills in Photosynthesis: A Rasch Model Approach

Peer reviewed
PDF on ERIC

Download full text

Suwita Suwita; Sulistyo Saputro; Sajidan Sajidan; Sutarno Sutarno – Journal of Baltic Science Education, 2024

The current study uses the Rasch Model to measure lower-secondary school students' critical thinking skills on photosynthesis topics. Critical thinking skills are considered essential in science education, but few valid and practical measurement instruments remain. The current study fills the gap by adapting the instrument from the Watson-Glaser…

Descriptors: Secondary School Students, Critical Thinking, Thinking Skills, Botany

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Developing and Validating a Biological System Thinking Test for Middle School Students

Peer reviewed

Direct link

Ruying Li; Gaofeng Li – International Journal of Science and Mathematics Education, 2025

Systems thinking (ST) is an essential competence for future life and biology learning. Appropriate assessment is critical for collecting sufficient information to develop ST in biology education. This research offers an ST framework based on a comprehensive understanding of biological systems, encompassing four skills across three complexity…

Descriptors: Test Construction, Test Validity, Science Tests, Cognitive Tests

Validity and Reliability Analysis of a Socioscientific Issues-Based Critical Thinking Self-Assessment Instrument Using the Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025

Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…

Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity

Measuring Up: Rasch Analysis of English Reading Comprehension Test for Informal Education Learners

Peer reviewed
PDF on ERIC

Download full text

Arandha May Rachmawati; Agus Widyantoro – English Language Teaching Educational Journal, 2025

This study aims to evaluate the quality of English reading comprehension test instruments used in informal learning, especially as English literacy tests. With a quantitative approach, the analysis was carried out using the Rasch model through the Quest program on 30 multiple-choice questions given to 30 grade IX students from informal educational…

Descriptors: Item Response Theory, Reading Tests, Reading Comprehension, English (Second Language)

Design, Development, and Evaluation of the Organic Chemistry Representational Competence Assessment (ORCA)

Peer reviewed

Direct link

Lyniesha Ward; Fridah Rotich; Jeffrey R. Raker; Regis Komperda; Sachin Nedungadi; Maia Popova – Chemistry Education Research and Practice, 2025

This paper describes the design and evaluation of the Organic chemistry Representational Competence Assessment (ORCA). Grounded in Kozma and Russell's representational competence framework, the ORCA measures the learner's ability to "interpret," "translate," and "use" six commonly used representations of molecular…

Descriptors: Organic Chemistry, Science Tests, Test Construction, Student Evaluation

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

Educational and Psychological…	10
Online Submission	9
Journal of Educational…	8
ProQuest LLC	7
Grantee Submission	5
Journal of Experimental…	5
Physical Review Physics…	5
Applied Measurement in…	4
International Journal of…	4
International Journal of…	4
Journal of Turkish Science…	4
SAGE Open	4
Applied Psychological…	3
Chemistry Education Research…	3
ETS Research Report Series	3
International Journal of…	3
International Journal of…	3
International Journal of…	3
Practical Assessment,…	3
Advances in Health Sciences…	2
Assessment for Effective…	2
Behavioral Research and…	2
CBE - Life Sciences Education	2
Cypriot Journal of…	2
Educational Assessment	2
More ▼

Schoen, Robert C.	6
DiLuzio, Geneva J.	4
Yang, Xiaotong	4
Anderson, Daniel	3
Huck, Schuyler W.	3
Paek, Insu	3
Thompson, Bruce	3
Weiten, Wayne	3
Alexander, Patricia A.	2
Alonzo, Julie	2
Bauduin, Charity	2
Cliff, Norman	2
Feldt, Leonard S.	2
Frisbie, David A.	2
Henning, Grant	2
Istiyono, Edi	2
Lee, Young-Sun	2
Liu, Sicong	2
Loyd, Brenda H.	2
Metsämuuronen, Jari	2
Mike Stieff	2
Perez, Kathryn E.	2
Petscher, Yaacov	2
Pollock, Steven J.	2
More ▼

Test of English as a Foreign…	3
Flesch Kincaid Grade Level…	2
Raven Progressive Matrices	2
SAT (College Admission Test)	2
Test of English for…	2
Adult Attachment Interview	1
Armed Services Vocational…	1
Bayley Scales of Infant…	1
Child Behavior Checklist	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Dynamic Indicators of Basic…	1
Embedded Figures Test	1
Flesch Reading Ease Formula	1
Graduate Management Admission…	1
Graduate Record Examinations	1
Hidden Figures Test	1
Iowa Tests of Basic Skills	1
Matching Familiar Figures Test	1
Measures of Academic Progress	1
Metropolitan Achievement Tests	1
Peabody Developmental Motor…	1
Praxis Series	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
More ▼