ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	11

Descriptor

Error of Measurement	11
Item Response Theory	10
Achievement Tests	9
Foreign Countries	9
International Assessment	9
Secondary School Students	9
Comparative Analysis	5
Cross Cultural Studies	4
Mathematics Tests	4
Test Items	4
Simulation	3
Cultural Differences	2
Evaluation Methods	2
Factor Analysis	2
Goodness of Fit	2
Item Analysis	2
Mathematics Achievement	2
Measurement Techniques	2
Reading Comprehension	2
Reading Tests	2
Sample Size	2
Science Tests	2
Scores	2
Test Bias	2
Academic Achievement	1
More ▼

Source

Journal of Educational…	2
Applied Measurement in…	1
Assessment in Education:…	1
Educational and Psychological…	1
International Journal of…	1
International Journal of…	1
Journal of Educational and…	1
Novitas-ROYAL (Research on…	1
Participatory Educational…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	8
Dissertations/Theses -…	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Secondary Education	9
Elementary Secondary Education	1

Audience

Location

Australia	1
Finland	1
Turkey	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	11
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups

Peer reviewed

Direct link

Sean Joo; Montserrat Valdivia; Dubravka Svetina Valdivia; Leslie Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Evaluating scale comparability in international large-scale assessments depends on measurement invariance (MI). The root mean square deviation (RMSD) is a standard method for establishing MI in several programs, such as the Programme for International Student Assessment and the Programme for the International Assessment of Adult Competencies.…

Descriptors: International Assessment, Monte Carlo Methods, Statistical Studies, Error of Measurement

Examining the Impact of Violations of Local Item Independence Assumption on Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Mehmet Fatih Doguyurt; Seref Tan – International Journal of Assessment Tools in Education, 2025

This study investigates the impact of violating the local item independence assumption by loading certain items onto a second dimension on test equating errors in unidimensional and dichotomous tests. The research was designed as a simulation study, using data generated based on the PISA 2018 mathematics exam. Analyses were conducted under 36…

Descriptors: Equated Scores, Test Items, Mathematics Tests, International Assessment

Examining Cross-Cultural Applicability via Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Soysal, Sümeyra – Participatory Educational Research, 2023

Applying a measurement instrument developed in a specific country to other countries raise a critical and important question of interest in especially cross-cultural studies. Confirmatory factor analysis (CFA) is the most preferred and used method to examine the cross-cultural applicability of measurement tools. Although CFA is a sophisticated…

Descriptors: Generalization, Cross Cultural Studies, Measurement Techniques, Factor Analysis

From OLS to Multilevel Multidimensional Mixture IRT: A Model Refinement Approach to Investigating Patterns of Relationships in PISA 2012 Data

Direct link

Gulsah Gurkan – ProQuest LLC, 2021

Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…

Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Conditioning: How Background Variables Can Influence PISA Scores

Peer reviewed

Direct link

Zieger, Laura Raffaella; Jerrim, J.; Anders, J.; Shure, N. – Assessment in Education: Principles, Policy & Practice, 2022

The OECD's Programme for International Student Assessment (PISA) has become one of the key studies for evidence-based education policymaking across the globe. PISA has however received a lot of methodological criticism, including how the test scores are created. The aim of this paper is to investigate the so-called 'conditioning model', where…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

The Examination of Measurement Invariance and Differential Item Functioning of PISA 2015 Cognitive Tests in Terms of the Commonly Used Languages

Peer reviewed
PDF on ERIC

Download full text

Sekercioglu, Güçlü; Kogar, Hakan – Novitas-ROYAL (Research on Youth and Language), 2018

The aim of the present study was to examine the measurement invariance (MI) of the reading, mathematics, and science tests in terms of the commonly used languages. It also aimed to examine the differential item functioning (DIF) of the PISA test, the original items of which are in the languages of English and French, in terms of the language…

Descriptors: Error of Measurement, Item Response Theory, International Assessment, Achievement Tests

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Characterizing Sources of Uncertainty in Item Response Theory Scale Scores

Peer reviewed

Direct link

Yang, Ji Seung; Hansen, Mark; Cai, Li – Educational and Psychological Measurement, 2012

Traditional estimators of item response theory scale scores ignore uncertainty carried over from the item calibration process, which can lead to incorrect estimates of the standard errors of measurement (SEMs). Here, the authors review a variety of approaches that have been applied to this problem and compare them on the basis of their statistical…

Descriptors: Item Response Theory, Scores, Statistical Analysis, Comparative Analysis

Comparing OECD PISA Reading in English to Other Languages: Identifying Potential Sources of Non-Invariance

Peer reviewed

Direct link

Asil, Mustafa; Brown, Gavin T. L. – International Journal of Testing, 2016

The use of the Programme for International Student Assessment (PISA) across nations, cultures, and languages has been criticized. The key criticisms point to the linguistic and cultural biases potentially underlying the design of reading comprehension tests, raising doubts about the legitimacy of comparisons across economies. Our research focused…

Descriptors: Comparative Analysis, Reading Achievement, Achievement Tests, Secondary School Students

Abulela, Mohammed A. A.	1
Anders, J.	1
Asil, Mustafa	1
Bolsinova, Maria	1
Brown, Gavin T. L.	1
Cai, Li	1
Dubravka Svetina Valdivia	1
Gulsah Gurkan	1
Haag, Nicole	1
Hansen, Mark	1
Jerrim, J.	1
Kogar, Hakan	1
Leslie Rutkowski	1
Liaw, Yuan-Ling	1
Mehmet Fatih Doguyurt	1
Montserrat Valdivia	1
Rios, Joseph A.	1
Roppelt, Alexander	1
Rutkowski, David	1
Rutkowski, Leslie	1
Sachse, Karoline A.	1
Sean Joo	1
Sekercioglu, Güçlü	1
Seref Tan	1
Shure, N.	1
More ▼