ERIC - Search Results

Publication Date

In 2026	0
Since 2025	9
Since 2022 (last 5 years)	112
Since 2017 (last 10 years)	216
Since 2007 (last 20 years)	377

Descriptor

Comparative Analysis	598
Item Analysis	598
Test Items	230
Foreign Countries	182
Scores	103
Item Response Theory	98
Statistical Analysis	97
Correlation	93
Test Construction	86
Factor Analysis	83
Difficulty Level	80
Models	63
Student Attitudes	62
Test Reliability	60
Test Validity	57
Measures (Individuals)	56
Achievement Tests	55
English (Second Language)	55
Second Language Learning	54
Evaluation Methods	52
Computer Assisted Testing	51
Questionnaires	51
Multiple Choice Tests	50
Psychometrics	50
Language Tests	49
More ▼

Education Level

Higher Education	107
Postsecondary Education	90
Secondary Education	73
Elementary Education	46
Elementary Secondary Education	27
High Schools	23
Middle Schools	22
Junior High Schools	17
Grade 4	12
Early Childhood Education	10
Grade 8	10
Intermediate Grades	10
Grade 6	9
Primary Education	8
Grade 5	7
Grade 7	6
Grade 3	5
Adult Education	4
Grade 12	4
Grade 9	4
Kindergarten	4
Preschool Education	4
Grade 1	3
Grade 2	3
Grade 10	2
More ▼

Audience

Researchers	15
Practitioners	4
Teachers	4
Students	2
Policymakers	1

Location

Australia	13
China	13
Germany	13
Turkey	13
Canada	8
United Kingdom	8
United Kingdom (England)	8
United States	8
Indonesia	7
Iran	7
Japan	7
Saudi Arabia	6
South Korea	6
Israel	5
Vietnam	5
Netherlands	4
Spain	4
Thailand	4
Belgium	3
Chile	3
Czech Republic	3
Europe	3
Finland	3
Hong Kong	3
India	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Comparative Analysis X

Showing 76 to 90 of 598 results Save | Export

How Do Physics Students Evaluate Artificial Intelligence Responses on Comprehension Questions? A Study on the Perceived Scientific Accuracy and Linguistic Quality of ChatGPT

Peer reviewed

Direct link

Dahlkemper, Merten Nikolay; Lahme, Simon Zacharias; Klein, Pascal – Physical Review Physics Education Research, 2023

This study aimed at evaluating how students perceive the linguistic quality and scientific accuracy of ChatGPT responses to physics comprehension questions. A total of 102 first- and second-year physics students were confronted with three questions of progressing difficulty from introductory mechanics (rolling motion, waves, and fluid dynamics).…

Descriptors: Physics, Science Instruction, Artificial Intelligence, Computer Software

Mitigating Gender and L1 Biases in Automated English Speaking Assessment

Direct link

Alexander James Kwako – ProQuest LLC, 2023

Automated assessment using Natural Language Processing (NLP) has the potential to make English speaking assessments more reliable, authentic, and accessible. Yet without careful examination, NLP may exacerbate social prejudices based on gender or native language (L1). Current NLP-based assessments are prone to such biases, yet research and…

Descriptors: Gender Bias, Natural Language Processing, Native Language, Computational Linguistics

Developing the Diagnostic Test of Misconceptions of Fractions

Peer reviewed
PDF on ERIC

Download full text

Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023

This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…

Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Item-Score Reliability in Empirical-Data Sets and Its Relationship with Other Item Indices

Peer reviewed

Direct link

Zijlmans, Eva A. O.; Tijmstra, Jesper; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2018

Reliability is usually estimated for a total score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the repeatability of an individual item score in a group. Three methods to estimate item-score reliability are discussed, known as method MS, method [lambda][subscript 6], and method CA. The item-score…

Descriptors: Test Items, Test Reliability, Correlation, Comparative Analysis

Seeing the Forest and the Trees: Comparison of Two IRTree Models to Investigate the Impact of Full versus Endpoint-Only Response Option Labeling

Peer reviewed

Direct link

Spratto, Elisabeth M.; Leventhal, Brian C.; Bandalos, Deborah L. – Educational and Psychological Measurement, 2021

In this study, we examined the results and interpretations produced from two different IRTree models--one using paths consisting of only dichotomous decisions, and one using paths consisting of both dichotomous and polytomous decisions. We used data from two versions of an impulsivity measure. In the first version, all the response options had…

Descriptors: Comparative Analysis, Item Response Theory, Decision Making, Data Analysis

Beyond Group Comparisons: Accounting for Intersectional Sources of Bias in International Survey Measures

Peer reviewed

Direct link

Rujun Xu; James Soland – International Journal of Testing, 2024

International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…

Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries

An Intersectional Approach to DIF: Comparing Outcomes across Methods

Peer reviewed

Direct link

Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022

Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…

Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction

Sensitive Questions in Cross-National Comparative Surveys

Peer reviewed

Direct link

Andreenkova, A. V. – Russian Education & Society, 2019

The article is devoted to the problem of survey items that ask for sensitive information. This factor has a significant impact on the quality and comparability of data from international surveys. We propose a methodology that can be used to comparatively study the level of sensitivity of questions. It is often used in public opinion polls as well…

Descriptors: Cross Cultural Studies, Surveys, Foreign Countries, Classification

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

The Comparison of Item Test Characteristics Viewed from Classic and Modern Test Theory

Peer reviewed
PDF on ERIC

Download full text

Subali, Bambang; Kumaidi; Aminah, Nonoh Siti – International Journal of Instruction, 2021

This research aims at comparing item characteristics of instruments for assessing the level of mastery in scientific method for elementary students as they were analyzed using Classical Test Theory (CTT) and Item Response Theory (IRT). The two analyses are usually done separately, for difference object, in this moment it was analyzed…

Descriptors: Test Items, Item Response Theory, Item Analysis, Comparative Analysis

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Surveying through Gatekeepers in Social Research: Methodological Problems and Suggestions

Peer reviewed

Direct link

Lamprianou, Iasonas – International Journal of Social Research Methodology, 2022

Recruiting participants through gatekeepers has been widely discussed in qualitative research. However, when a sampling frame is not available, surveying through gatekeepers can also be important for quantitative studies. We used three sampling methods to survey guardians of University students: (a) a gatekeeper variant of the time-space sampling,…

Descriptors: Research Problems, Social Science Research, Qualitative Research, Sampling

Deliberate Practice of Spreadsheet Skills When Using Copiable, Randomized, and Auto-Graded Questions within an Interactive Textbook

Peer reviewed
PDF on ERIC

Download full text

Gorbett, Luke J.; Chapamn, Kayla E.; Liberatore, Matthew W. – Advances in Engineering Education, 2022

Spreadsheets are a core computational tool for practicing engineers and engineering students. While Microsoft Excel, Google Sheets, and other spreadsheet tools have some differences, numerous formulas, functions, and other tasks are common across versions and platforms. Building upon learning science frameworks showing that interactive activities…

Descriptors: Spreadsheets, Computer Software, Engineering Education, Textbooks

Pre-Academic Learning Self-Efficacy Revisited: Validation in the Danish Academy Profession Degree Context and Differences across Degree Programs

Peer reviewed

Direct link

Nielsen, Tine – Scandinavian Journal of Educational Research, 2022

The relevance of academic self-efficacy for educational outcomes is well documented. Pre-academic self-efficacy has hardly been studied, and only one study was found to include an assessment of the measurement invariance of the scale used. The aims were to validate the Pre-Academic Learning Self-Efficacy (PAL-SE) scale in a non-university higher…

Descriptors: Self Efficacy, Pandemics, Scores, Foreign Countries

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 40

Educational and Psychological…	36
Journal of Educational…	21
ProQuest LLC	20
ETS Research Report Series	11
Applied Measurement in…	10
Applied Psychological…	10
Language Testing	9
Online Submission	8
Grantee Submission	7
International Journal of…	7
Journal of Educational and…	7
International Educational…	5
International Journal of…	5
International Journal of…	5
Journal of Consulting and…	5
Journal of Experimental…	5
Measurement:…	5
Physical Review Physics…	5
Journal of Experimental…	4
Journal of Psychoeducational…	4
Journal of Speech, Language,…	4
Language Assessment Quarterly	4
Practical Assessment,…	4
Psychometrika	4
Scandinavian Journal of…	4
More ▼

Hambleton, Ronald K.	5
Weiss, David J.	4
Bashaw, W. L.	3
Benson, Jeri	3
Blanton, Maria	3
Facon, Bruno	3
Gongjun Xu	3
Haladyna, Tom	3
Knuth, Eric	3
Lord, Frederic M.	3
Reckase, Mark D.	3
Stephens, Ana	3
Strachota, Susanne	3
Stroud, Rena	3
Stylianou, Despina	3
Vale, C. David	3
Allan S. Cohen	2
Angoff, William H.	2
Ayan, Cansu	2
Baghaei, Purya	2
Bennett, Randy Elliot	2
Bratfisch, Oswald	2
Chun Wang	2
Dawis, Rene V.	2
More ▼

Reports - Research	412
Journal Articles	396
Reports - Evaluative	75
Speeches/Meeting Papers	56
Tests/Questionnaires	29
Dissertations/Theses -…	20
Numerical/Quantitative Data	18
Reports - Descriptive	15
Information Analyses	11
Books	5
Collected Works - General	4
Guides - Non-Classroom	4
Opinion Papers	3
Collected Works - Serials	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
Reports - General	1
Translations	1
More ▼

Program for International…	16
SAT (College Admission Test)	11
Trends in International…	8
National Assessment of…	4
Test of English as a Foreign…	4
California Achievement Tests	3
Raven Progressive Matrices	3
Beck Depression Inventory	2
Childrens Manifest Anxiety…	2
Eysenck Personality Inventory	2
Graduate Record Examinations	2
International English…	2
Iowa Tests of Educational…	2
Metropolitan Achievement Tests	2
Minnesota Multiphasic…	2
Peabody Picture Vocabulary…	2
Sequential Tests of…	2
Stanford Binet Intelligence…	2
ACT Assessment	1
Armed Services Vocational…	1
Autism Diagnostic Observation…	1
Bem Sex Role Inventory	1
Bender Gestalt Test	1
Boehm Test of Basic Concepts	1
California Critical Thinking…	1
More ▼