Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 13 |
| Since 2017 (last 10 years) | 27 |
| Since 2007 (last 20 years) | 38 |
Descriptor
| International Assessment | 38 |
| Test Reliability | 38 |
| Foreign Countries | 34 |
| Achievement Tests | 31 |
| Test Validity | 22 |
| Secondary School Students | 19 |
| Elementary Secondary Education | 11 |
| Item Response Theory | 10 |
| Mathematics Tests | 9 |
| Test Construction | 9 |
| Mathematics Achievement | 8 |
| More ▼ | |
Source
Author
| Kelecioglu, Hülya | 2 |
| Aditomo, Anindito | 1 |
| Anjar Putro Utomo | 1 |
| Arffman, Inga | 1 |
| Arikan, Çigdem Akin | 1 |
| Balcão Reis, Ana | 1 |
| Barbara Bruno | 1 |
| Bellens, Kim | 1 |
| Björn Andersson | 1 |
| Braeken, Johan | 1 |
| Brese, Falk | 1 |
| More ▼ | |
Publication Type
Education Level
| Secondary Education | 25 |
| Elementary Secondary Education | 11 |
| Elementary Education | 9 |
| Middle Schools | 8 |
| Junior High Schools | 6 |
| High Schools | 5 |
| Grade 4 | 4 |
| Grade 8 | 4 |
| Intermediate Grades | 4 |
| Grade 9 | 3 |
| Postsecondary Education | 3 |
| More ▼ | |
Audience
Location
| Indonesia | 5 |
| Germany | 3 |
| Singapore | 3 |
| United States | 3 |
| Australia | 2 |
| Belgium | 2 |
| Finland | 2 |
| Hong Kong | 2 |
| Massachusetts | 2 |
| Norway | 2 |
| United Kingdom (England) | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yuriko K. Sosa Paredes; Björn Andersson – Educational Assessment, Evaluation and Accountability, 2025
In international large-scale assessments, student performance comparisons across educational systems are frequently done to assess the state and development in different domains. These results often have a large impact on educational policy and on the perceptions of an educational system's performance. Early assessments, such as the First and…
Descriptors: Test Interpretation, International Assessment, Science Tests, Scores
Jiayi Deng – ProQuest LLC, 2024
Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…
Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement
John Jerrim; Luis Alejandro Lopez-Agudo; Oscar David Marcenaro-Gutierrez – British Journal of Educational Studies, 2024
International large-scale assessments have gained much attention since the beginning of the twenty-first century, influencing education legislation in many countries. This includes Spain, where they have been used by successive governments to justify education policy change. Unfortunately, there was a problem with the PISA 2018 reading scores for…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
R. Ramadhani; Eni Nuraeni; Widi Purwianingsih – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Numeracy literacy is a crucial skill for understanding, using, and communicating effectively with numbers, facts, and mathematical procedures in various real-world situations. The development of numeracy literacy is crucial because it is one of the essential prerequisites for life skills in the 21st century. This study aims to develop a valid…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Kohen, Zehavit; Gharra-Badran, Yasmin – Teaching Mathematics and Its Applications, 2023
Mathematics modelling is a vital competency for students of all ages. In this study, we aim to fill the research gap about valid and reliable tools for assessing and grading mathematical modeling problems, particularly those reflecting multiple steps of the modelling cycle. We present in this paper the design of a reliable and valid assessment…
Descriptors: Scoring Rubrics, Mathematical Models, Test Construction, Test Validity
Wondimu Ahmed – Journal of Psychoeducational Assessment, 2024
This study examined the psychometric properties of the Positive and Negative Affect Schedule for Children-Short Form (PANAS-C-SF) in a diverse sample of 15-year-olds in the United States [N = 4382]. Multiple measurement models, including a one-factor model, two-factor orthogonal and oblique models, a three-factor model (PA, Fear, and Distress),…
Descriptors: Psychometrics, Affective Measures, Factor Analysis, Fear
Laila El-Hamamsy; María Zapata-Cáceres; Estefanía Martín-Barroso; Francesco Mondada; Jessica Dehler Zufferey; Barbara Bruno; Marcos Román-González – Technology, Knowledge and Learning, 2025
The introduction of computing education into curricula worldwide requires multi-year assessments to evaluate the long-term impact on learning. However, no single Computational Thinking (CT) assessment spans primary school, and no group of CT assessments provides a means of transitioning between instruments. This study therefore investigated…
Descriptors: Cognitive Tests, Computation, Thinking Skills, Test Validity
Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023
We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…
Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions
Ken Ardon – Pioneer Institute for Public Policy Research, 2024
This paper reviews overall student performance as well as the performance of student subgroups on the assessment system developed in response to the Massachusetts Education Reform Act of 1993 (MERA), the Massachusetts Comprehensive Assessment System (MCAS). Comparing students in Massachusetts to students in the rest of the United States or against…
Descriptors: Accuracy, Test Reliability, Elementary Secondary Education, Achievement Tests
Huang, Jinyan; Dong, Yaxin; Han, Chunwei; Wang, Xiaojun – Journal of Psychoeducational Assessment, 2023
Using expert reviews and item response theory (IRT), this study evaluated the language- and culture-related construct-irrelevant variance and reliability of the 2019 TIMSS sense of school belonging scale (SSBS) for grades 4 and 8. The five items of the SSBS, which were identical for both grades, were reviewed for the language- and culture-related…
Descriptors: Construct Validity, Test Reliability, Achievement Tests, Foreign Countries
Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022
Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…
Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries
Wicaksono, Azizul Ghofar Candra; Korom, Erzsébet – Participatory Educational Research, 2022
The accuracy of learning results relies on the evaluation and assessment. The learning goals, including problem solving ability must be aligned with the valid standardized measurement tools. The study on exploring the nature of problem-solving, framework, and assessment in the Indonesian context will make contributions to problem solving…
Descriptors: Problem Solving, Educational Research, Test Construction, Test Validity
Marksteiner, Tamara; Kuger, Susanne; Klieme, Eckhard – Assessment in Education: Principles, Policy & Practice, 2019
We investigate whether Anchoring Vignettes (AV) improve intercultural comparability of non-cognitive student-directed factors (e.g., procrastination). So far, correlation analyses for anchored and non-anchored scores with a criterion have been used to demonstrate the effectiveness of AV in improving data quality. However, correlation analyses are…
Descriptors: Vignettes, Equated Scores, International Assessment, Test Reliability
Gulsah Gurkan – ProQuest LLC, 2021
Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…
Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries

Peer reviewed
Direct link
