Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 8 |
| Since 2017 (last 10 years) | 13 |
| Since 2007 (last 20 years) | 18 |
Descriptor
| Sample Size | 18 |
| Foreign Countries | 17 |
| Achievement Tests | 15 |
| International Assessment | 15 |
| Secondary School Students | 14 |
| Comparative Analysis | 6 |
| Scores | 6 |
| Test Items | 6 |
| Statistical Analysis | 5 |
| Classification | 4 |
| Correlation | 4 |
| More ▼ | |
Source
Author
| Abulela, Mohammed A. A. | 1 |
| Adesope, Olusola | 1 |
| Aksu, Gökhan | 1 |
| Ana Balcão Reis | 1 |
| Andreas Frey | 1 |
| Austin, Bruce | 1 |
| Bolin, Jocelyn H. | 1 |
| Borgonovi, Francesca | 1 |
| Christoph König | 1 |
| Cohen, Allan S. | 1 |
| Eser, Mehmet Taha | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 17 |
| Reports - Research | 16 |
| Reports - Evaluative | 2 |
| Numerical/Quantitative Data | 1 |
Education Level
| Secondary Education | 15 |
| Elementary Secondary Education | 2 |
Audience
Location
| Indonesia | 2 |
| Portugal | 2 |
| Turkey | 2 |
| Australia | 1 |
| Colombia | 1 |
| Finland | 1 |
| Germany | 1 |
| Israel | 1 |
| Jordan | 1 |
| Latvia | 1 |
| Mexico | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 18 |
| Progress in International… | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Ricardo Colaço; Pedro Freitas; Luis Catela Nunes; Ana Balcão Reis – Large-scale Assessments in Education, 2025
We analyse the PISA-reported convergence in the performance of private and public schools in Portugal. When PISA sampling weights are used, the number of students enrolled in those types of schools and specific grades/tracks of study differs significantly from official population figures. To account for those differences, we apply a…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Okan Bulut; Guher Gorgun; Hacer Karamese – Journal of Educational Measurement, 2025
The use of multistage adaptive testing (MST) has gradually increased in large-scale testing programs as MST achieves a balanced compromise between linear test design and item-level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or can…
Descriptors: Response Style (Tests), Testing Problems, Testing Accommodations, Measurement
Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025
Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…
Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction
Mangino, Anthony A.; Bolin, Jocelyn H.; Finch, W. Holmes – Educational and Psychological Measurement, 2023
This study seeks to compare fixed and mixed effects models for the purposes of predictive classification in the presence of multilevel data. The first part of the study utilizes a Monte Carlo simulation to compare fixed and mixed effects logistic regression and random forests. An applied examination of the prediction of student retention in the…
Descriptors: Prediction, Classification, Monte Carlo Methods, Foreign Countries
Mehmet Fatih Doguyurt; Seref Tan – International Journal of Assessment Tools in Education, 2025
This study investigates the impact of violating the local item independence assumption by loading certain items onto a second dimension on test equating errors in unidimensional and dichotomous tests. The research was designed as a simulation study, using data generated based on the PISA 2018 mathematics exam. Analyses were conducted under 36…
Descriptors: Equated Scores, Test Items, Mathematics Tests, International Assessment
Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023
We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…
Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions
Han, Areum; Krieger, Florian; Borgonovi, Francesca; Greiff, Samuel – Large-scale Assessments in Education, 2023
Process data are becoming more and more popular in education research. In the field of computer-based assessments of collaborative problem solving (ColPS), process data have been used to identify students' test-taking strategies while working on the assessment, and such data can be used to complement data collected on accuracy and overall…
Descriptors: Behavior Patterns, Cooperative Learning, Problem Solving, Reaction Time
König, Christoph; Khorramdel, Lale; Yamamoto, Kentaro; Frey, Andreas – Educational Measurement: Issues and Practice, 2021
Large-scale assessments such as the Programme for International Student Assessment (PISA) have field trials where new survey features are tested for utility in the main survey. Because of resource constraints, there is a trade-off between how much of the sample can be used to test new survey features and how much can be used for the initial item…
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment
Aksu, Gökhan; Güzeller, Cem Oktay; Eser, Mehmet Taha – International Journal of Assessment Tools in Education, 2019
In this study, it was aimed to compare different normalization methods employed in model developing process via artificial neural networks with different sample sizes. As part of comparison of normalization methods, input variables were set as: work discipline, environmental awareness, instrumental motivation, science self-efficacy, and weekly…
Descriptors: Sample Size, Artificial Intelligence, Classification, Statistical Analysis
Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020
Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…
Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size
Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022
When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…
Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis
Toprak, Emre; Gelbal, Selahattin – International Journal of Assessment Tools in Education, 2020
This study aims to compare the performances of the artificial neural network, decision trees and discriminant analysis methods to classify student achievement. The study uses multilayer perceptron model to form the artificial neural network model, chi-square automatic interaction detection (CHAID) algorithm to apply the decision trees method and…
Descriptors: Comparative Analysis, Classification, Artificial Intelligence, Networks
Austin, Bruce; French, Brian; Adesope, Olusola; Gotch, Chad – Journal of Experimental Education, 2017
Measures of variability are successfully used in predictive modeling in research areas outside of education. This study examined how standard deviations can be used to address research questions not easily addressed using traditional measures such as group means based on index variables. Student survey data were obtained from the Organisation for…
Descriptors: Predictor Variables, Models, Predictive Measurement, Statistical Analysis
Jerrim, John; Wyness, Gill – London Review of Education, 2016
The Programme for International Student Assessment (PISA) is an important international study of 15-year-olds' academic achievement. Although PISA has traditionally been used to draw comparisons across countries, there is growing interest in the production of regional (i.e. city,state, or provincial level) results. In this paper we present the…
Descriptors: Benchmarking, Foreign Countries, Achievement Tests, International Assessment
Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank – Educational and Psychological Measurement, 2016
Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme…
Descriptors: Educational Assessment, Coding, Automation, Responses
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
