ERIC - Search Results

Publication Date

In 2026	0
Since 2025	9
Since 2022 (last 5 years)	112
Since 2017 (last 10 years)	216
Since 2007 (last 20 years)	377

Descriptor

Comparative Analysis	598
Item Analysis	598
Test Items	230
Foreign Countries	182
Scores	103
Item Response Theory	98
Statistical Analysis	97
Correlation	93
Test Construction	86
Factor Analysis	83
Difficulty Level	80
Models	63
Student Attitudes	62
Test Reliability	60
Test Validity	57
Measures (Individuals)	56
Achievement Tests	55
English (Second Language)	55
Second Language Learning	54
Evaluation Methods	52
Computer Assisted Testing	51
Questionnaires	51
Multiple Choice Tests	50
Psychometrics	50
Language Tests	49
More ▼

Education Level

Higher Education	107
Postsecondary Education	90
Secondary Education	73
Elementary Education	46
Elementary Secondary Education	27
High Schools	23
Middle Schools	22
Junior High Schools	17
Grade 4	12
Early Childhood Education	10
Grade 8	10
Intermediate Grades	10
Grade 6	9
Primary Education	8
Grade 5	7
Grade 7	6
Grade 3	5
Adult Education	4
Grade 12	4
Grade 9	4
Kindergarten	4
Preschool Education	4
Grade 1	3
Grade 2	3
Grade 10	2
More ▼

Audience

Researchers	15
Practitioners	4
Teachers	4
Students	2
Policymakers	1

Location

Australia	13
China	13
Germany	13
Turkey	13
Canada	8
United Kingdom	8
United Kingdom (England)	8
United States	8
Indonesia	7
Iran	7
Japan	7
Saudi Arabia	6
South Korea	6
Israel	5
Vietnam	5
Netherlands	4
Spain	4
Thailand	4
Belgium	3
Chile	3
Czech Republic	3
Europe	3
Finland	3
Hong Kong	3
India	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Comparative Analysis X

Showing 46 to 60 of 598 results Save | Export

Two IRT Characteristic Curve Linking Methods Weighted by Information

Peer reviewed

Direct link

Wang, Shaojie; Zhang, Minqiang; Lee, Won-Chan; Huang, Feifei; Li, Zonglong; Li, Yixing; Yu, Sufang – Journal of Educational Measurement, 2022

Traditional IRT characteristic curve linking methods ignore parameter estimation errors, which may undermine the accuracy of estimated linking constants. Two new linking methods are proposed that take into account parameter estimation errors. The item- (IWCC) and test-information-weighted characteristic curve (TWCC) methods employ weighting…

Descriptors: Item Response Theory, Error of Measurement, Accuracy, Monte Carlo Methods

Reliability and Validity Evidence of Diagnostic Methods: Comparison of Diagnostic Classification Models and Item Response Theory-Based Methods

Direct link

Yoo Jeong Jang – ProQuest LLC, 2022

Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…

Descriptors: Classification, Accuracy, Item Response Theory, Correlation

Refining Semantic Similarity of Paraphasias Using a Contextual Language Model

Peer reviewed

Direct link

Salem, Alexandra C.; Gale, Robert; Casilio, Marianne; Fleegle, Mikala; Fergadiotis, Gerasimos; Bedrick, Steven – Journal of Speech, Language, and Hearing Research, 2023

Purpose: ParAlg (Paraphasia Algorithms) is a software that automatically categorizes a person with aphasia's naming error (paraphasia) in relation to its intended target on a picture-naming test. These classifications (based on lexicality as well as semantic, phonological, and morphological similarity to the target) are important for…

Descriptors: Semantics, Computer Software, Aphasia, Classification

How Administration Stakes and Settings Affect Student Behavior and Performance on a Biology Concept Assessment

Peer reviewed

Direct link

Uminski, Crystal; Hubbard, Joanna K.; Couch, Brian A. – CBE - Life Sciences Education, 2023

Biology instructors use concept assessments in their courses to gauge student understanding of important disciplinary ideas. Instructors can choose to administer concept assessments based on participation (i.e., lower stakes) or the correctness of responses (i.e., higher stakes), and students can complete the assessment in an in-class or…

Descriptors: Biology, Science Tests, High Stakes Tests, Scores

Impact of DIF on General Factor Mean Comparisons for Bifactor, Ordinal Data

Peer reviewed

Direct link

Liu, Yixing; Thompson, Marilyn S. – Journal of Experimental Education, 2022

A simulation study was conducted to explore the impact of differential item functioning (DIF) on general factor difference estimation for bifactor, ordinal data. Common analysis misspecifications in which the generated bifactor data with DIF were fitted using models with equality constraints on noninvariant item parameters were compared under data…

Descriptors: Comparative Analysis, Item Analysis, Sample Size, Error of Measurement

Using Lasso and Adaptive Lasso to Identify DIF in Multidimensional 2PL Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Chun Wang; Ruoyi Zhu; Gongjun Xu – Grantee Submission, 2022

Differential item functioning (DIF) analysis refers to procedures that evaluate whether an item's characteristic differs for different groups of persons after controlling for overall differences in performance. DIF is routinely evaluated as a screening step to ensure items behavior the same across groups. Currently, the majority DIF studies focus…

Descriptors: Models, Item Response Theory, Item Analysis, Comparative Analysis

PBLRQA Model to the Development of Metacognitive Awareness in Pre-Service Teachers

Peer reviewed
PDF on ERIC

Download full text

Marleny Leasa; Mariana Rengkuan; John Rafafy Batlolona – Journal of Education and Learning (EduLearn), 2024

Metacognition is one of the key learning skills in the 21st century, with a strong potential to help students succeed in science learning. Until now, this metacognitive awareness is less empowered by lecturers in learning. This study aimed to analyze the problem-based learning (PBL) reading-questioning-answering (PBLRQA) model's effect on…

Descriptors: Metacognition, Preservice Teachers, Teacher Education Programs, Academic Achievement

Psychometric Evaluation of an Alternate Scoring for the Remote Associates Test

Peer reviewed

Direct link

Beisemann, Marie; Forthmann, Boris; Bürkner, Paul-Christian; Holling, Heinz – Journal of Creative Behavior, 2020

The Remote Associates Test (RAT; Mednick, 1962; Mednick & Mednick, 1967) is a commonly employed test of creative convergent thinking. The RAT is scored with a dichotomous scoring, scoring correct answers as 1 and all other answers as 0. Based on recent research into the information processing underlying RAT performance, we argued that the…

Descriptors: Psychometrics, Scoring, Tests, Semantics

Treatments of Differential Item Functioning: A Comparison of Four Methods

Peer reviewed

Direct link

Liu, Xiaowen; Jane Rogers, H. – Educational and Psychological Measurement, 2022

Test fairness is critical to the validity of group comparisons involving gender, ethnicities, culture, or treatment conditions. Detection of differential item functioning (DIF) is one component of efforts to ensure test fairness. The current study compared four treatments for items that have been identified as showing DIF: deleting, ignoring,…

Descriptors: Item Analysis, Comparative Analysis, Culture Fair Tests, Test Validity

Some Sentences Prime Pragmatic Reasoning in the Verification and Evaluation of Comparisons

Peer reviewed

Direct link

Shukla, Vishakha; Long, Madeleine; Bhatia, Vrinda; Rubio-Fernandez, Paula – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022

While most research on scalar implicature has focused on the lexical scale "some" vs "all," here we investigated an understudied scale formed by two syntactic constructions: categorizations (e.g., "Wilma is a nurse") and comparisons ("Wilma is like a nurse"). An experimental study by Rubio-Fernandez et al.…

Descriptors: Cues, Pragmatics, Comparative Analysis, Syntax

Effects of Response Option Order on Likert-Type Psychometric Properties and Reactions

Peer reviewed

Direct link

Robie, Chet; Meade, Adam W.; Risavy, Stephen D.; Rasheed, Sabah – Educational and Psychological Measurement, 2022

The effects of different response option orders on survey responses have been studied extensively. The typical research design involves examining the differences in response characteristics between conditions with the same item stems and response option orders that differ in valence--either incrementally arranged (e.g., strongly disagree to…

Descriptors: Likert Scales, Psychometrics, Surveys, Responses

The Diverse Landscape of Negative Polarity Items: On the Use of German NPIs as Experimental Diagnostics

Peer reviewed

Direct link

Schaebbicke, Katharina; Seeliger, Heiko; Repp, Sophie – Journal of Psycholinguistic Research, 2021

The goal of this study is to provide better empirical insight into the licensing conditions of a large set of NPIs in German so that they can be used as reliable diagnostics in future research on negation-related phenomena. Experiment 1 tests the acceptability of 60 NPIs under semantic operators that are expected to license superstrong, strong,…

Descriptors: German, Phrase Structure, Semantics, Language Research

Online Test Administration Results in Students Selecting More Responses to Multiple-Choice-Multiple-Response Items

Peer reviewed

Direct link

Olsho, Alexis; Smith, Trevor I.; Eaton, Philip; Zimmerman, Charlotte; Boudreaux, Andrew; White Brahmia, Suzanne – Physical Review Physics Education Research, 2023

We developed the Physics Inventory of Quantitative Literacy (PIQL) to assess students' quantitative reasoning in introductory physics contexts. The PIQL includes several "multiple-choice-multipleresponse" (MCMR) items (i.e., multiple-choice questions for which more than one response may be selected) as well as traditional single-response…

Descriptors: Multiple Choice Tests, Science Tests, Physics, Measures (Individuals)

Test Score Equating of Multiple-Choice Mathematics Items: Techniques from Characteristic Curve of Modern Psychometric Theory

Peer reviewed

Direct link

Musa Adekunle Ayanwale – Discover Education, 2023

Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…

Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 40

Educational and Psychological…	36
Journal of Educational…	21
ProQuest LLC	20
ETS Research Report Series	11
Applied Measurement in…	10
Applied Psychological…	10
Language Testing	9
Online Submission	8
Grantee Submission	7
International Journal of…	7
Journal of Educational and…	7
International Educational…	5
International Journal of…	5
International Journal of…	5
Journal of Consulting and…	5
Journal of Experimental…	5
Measurement:…	5
Physical Review Physics…	5
Journal of Experimental…	4
Journal of Psychoeducational…	4
Journal of Speech, Language,…	4
Language Assessment Quarterly	4
Practical Assessment,…	4
Psychometrika	4
Scandinavian Journal of…	4
More ▼

Hambleton, Ronald K.	5
Weiss, David J.	4
Bashaw, W. L.	3
Benson, Jeri	3
Blanton, Maria	3
Facon, Bruno	3
Gongjun Xu	3
Haladyna, Tom	3
Knuth, Eric	3
Lord, Frederic M.	3
Reckase, Mark D.	3
Stephens, Ana	3
Strachota, Susanne	3
Stroud, Rena	3
Stylianou, Despina	3
Vale, C. David	3
Allan S. Cohen	2
Angoff, William H.	2
Ayan, Cansu	2
Baghaei, Purya	2
Bennett, Randy Elliot	2
Bratfisch, Oswald	2
Chun Wang	2
Dawis, Rene V.	2
More ▼

Reports - Research	412
Journal Articles	396
Reports - Evaluative	75
Speeches/Meeting Papers	56
Tests/Questionnaires	29
Dissertations/Theses -…	20
Numerical/Quantitative Data	18
Reports - Descriptive	15
Information Analyses	11
Books	5
Collected Works - General	4
Guides - Non-Classroom	4
Opinion Papers	3
Collected Works - Serials	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
Translations	1
More ▼

Program for International…	16
SAT (College Admission Test)	11
Trends in International…	8
National Assessment of…	4
Test of English as a Foreign…	4
California Achievement Tests	3
Raven Progressive Matrices	3
Beck Depression Inventory	2
Childrens Manifest Anxiety…	2
Eysenck Personality Inventory	2
Graduate Record Examinations	2
International English…	2
Iowa Tests of Educational…	2
Metropolitan Achievement Tests	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
Sequential Tests of…	2
Stanford Binet Intelligence…	2
ACT Assessment	1
Armed Services Vocational…	1
Autism Diagnostic Observation…	1
Bem Sex Role Inventory	1
Bender Gestalt Test	1
Boehm Test of Basic Concepts	1
California Critical Thinking…	1
More ▼