Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 9 |
| Since 2007 (last 20 years) | 38 |
Descriptor
| Comparative Analysis | 58 |
| Test Items | 58 |
| Data Analysis | 33 |
| Foreign Countries | 18 |
| Scores | 14 |
| Item Analysis | 13 |
| Achievement Tests | 12 |
| Test Construction | 12 |
| Mathematics Tests | 11 |
| Tables (Data) | 11 |
| Academic Achievement | 10 |
| More ▼ | |
Source
Author
| Donovan, Jenny | 3 |
| Lennon, Melissa | 3 |
| Sinharay, Sandip | 3 |
| Hutton, Penny | 2 |
| Morrissey, Noni | 2 |
| O'Connor, Gayl | 2 |
| Reckase, Mark D. | 2 |
| von Davier, Matthias | 2 |
| Aktas, Elif | 1 |
| Arora, Alka, Ed. | 1 |
| Baldwin, Peter | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 13 |
| Elementary Education | 7 |
| Secondary Education | 6 |
| Grade 8 | 4 |
| High Schools | 4 |
| Higher Education | 4 |
| Postsecondary Education | 4 |
| Grade 4 | 3 |
| Grade 6 | 3 |
| Intermediate Grades | 2 |
| Grade 12 | 1 |
| More ▼ | |
Audience
| Researchers | 1 |
Location
| Australia | 6 |
| United Kingdom (England) | 2 |
| California | 1 |
| Canada | 1 |
| Czech Republic | 1 |
| Ohio | 1 |
| Oregon | 1 |
| South Africa | 1 |
| Turkey | 1 |
| United Kingdom | 1 |
| United States | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025
Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…
Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Yixi Wang – ProQuest LLC, 2020
Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…
Descriptors: Item Response Theory, Educational Testing, Data, Models
Sinharay, Sandip – Journal of Educational Measurement, 2017
Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…
Descriptors: Goodness of Fit, Testing, Test Items, Scores
Kadijevich, Djordje M. – British Journal of Educational Technology, 2015
Because the relationship between computer use and achievement is still puzzling, there is a need to prepare and analyze good quality datasets on computer use and achievement. Such a dataset can be derived from TIMSS data. This paper describes how this dataset can be prepared. It also gives an example of how the dataset may be analyzed. The…
Descriptors: Mathematics Achievement, Computer Use, Test Items, Problem Sets
Fife, James H.; James, Kofi; Peters, Stephanie – ETS Research Report Series, 2020
The concept of variability is central to statistics. In this research report, we review mathematics education research on variability and, based on that review and on feedback from an expert panel, propose a learning progression (LP) for variability. The structure of the proposed LP consists of 5 levels of sophistication in understanding…
Descriptors: Mathematics Education, Statistics Education, Feedback (Response), Research Reports
Cresswell, John; Schwantner, Ursula; Waters, Charlotte – OECD Publishing, 2015
This report reviews the major international and regional large-scale educational assessments, including international surveys, school-based surveys and household-based surveys. The report compares and contrasts the cognitive and contextual data collection instruments and implementation methods used by the different assessments in order to identify…
Descriptors: International Assessment, Educational Assessment, Data Collection, Comparative Analysis
Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018
Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…
Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education
Zhu, Mengxiao; Shu, Zhan; von Davier, Alina A. – Journal of Educational Measurement, 2016
New technology enables interactive and adaptive scenario-based tasks (SBTs) to be adopted in educational measurement. At the same time, it is a challenging problem to build appropriate psychometric models to analyze data collected from these tasks, due to the complexity of the data. This study focuses on process data collected from SBTs. We…
Descriptors: Measurement, Data Collection, National Competency Tests, Scoring Rubrics
Uzuner Yurt, Serap; Aktas, Elif – Educational Research and Reviews, 2016
In this study, the effects of the use of peer tutoring in Effective and Good Speech Course on students' success, perception of speech self-efficacy and speaking skills were examined. The study, designed as a mixed pattern in which quantitative and qualitative research approaches were combined, was carried out together with 57 students in 2014 to…
Descriptors: Peer Teaching, Tutoring, Higher Education, College Students
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Singer, Judith D., Ed.; Braun, Henry I., Ed.; Chudowsky, Naomi, Ed. – National Academy of Education, 2018
Results from international large-scale assessments (ILSAs) garner considerable attention in the media, academia, and among policy makers. Although there is widespread recognition that ILSAs can provide useful information, there is debate about what types of comparisons are the most meaningful and what could be done to assure more sound…
Descriptors: International Education, Educational Assessment, Educational Policy, Data Interpretation
Pelánek, Radek; Rihák, Ji?rí – International Educational Data Mining Society, 2016
In online educational systems we can easily collect and analyze extensive data about student learning. Current practice, however, focuses only on some aspects of these data, particularly on correctness of students answers. When a student answers incorrectly, the submitted wrong answer can give us valuable information. We provide an overview of…
Descriptors: Foreign Countries, Online Systems, Geography, Anatomy
Long, Caroline; Wendt, Heike – African Journal of Research in Mathematics, Science and Technology Education, 2017
South Africa participated in TIMSS from 1995 to 2015. Over these two decades, some positive changes have been reported on the aggregated mathematics performance patterns of South African learners. This paper focuses on the achievement patterns of South Africa's high-performing Grade 9 learners (n = 3378) in comparison with similar subsamples of…
Descriptors: Foreign Countries, Comparative Analysis, Multiplication, Comparative Education
Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015
One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Peer reviewed
Direct link
