ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	8
Since 2017 (last 10 years)	13
Since 2007 (last 20 years)	18

Source

Educational and Psychological…	3
International Journal of…	3
Journal of Educational…	2
Large-scale Assessments in…	2
Applied Measurement in…	1
Comparative Education Review	1
Educational Measurement:…	1
International Journal of…	1
Journal of Experimental…	1
London Review of Education	1
Measurement:…	1
National Education Policy…	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	16
Reports - Evaluative	2
Numerical/Quantitative Data	1

Education Level

Secondary Education	15
Elementary Secondary Education	2

Audience

Location

Indonesia	2
Portugal	2
Turkey	2
Australia	1
Colombia	1
Finland	1
Germany	1
Israel	1
Jordan	1
Latvia	1
Mexico	1
Peru	1
Qatar	1
Romania	1
Singapore	1
Spain	1
United Kingdom (London)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	18
Progress in International…	1
Trends in International…	1

What Works Clearinghouse Rating

Program for International Student Assessment X

Showing 1 to 15 of 18 results Save | Export

Understanding the Private-Public School Performance Gap in PISA: Evidence from Portugal

Peer reviewed

Direct link

Ricardo Colaço; Pedro Freitas; Luis Catela Nunes; Ana Balcão Reis – Large-scale Assessments in Education, 2025

We analyse the PISA-reported convergence in the performance of private and public schools in Portugal. When PISA sampling weights are used, the number of students enrolled in those types of schools and specific grades/tracks of study differs significantly from official population figures. To account for those differences, we apply a…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Incorporating Test-Taking Engagement into Multistage Adaptive Testing Design for Large-Scale Assessments

Peer reviewed

Direct link

Okan Bulut; Guher Gorgun; Hacer Karamese – Journal of Educational Measurement, 2025

The use of multistage adaptive testing (MST) has gradually increased in large-scale testing programs as MST achieves a balanced compromise between linear test design and item-level adaptive testing. MST works on the premise that each examinee gives their best effort when attempting the items, and their responses truly reflect what they know or can…

Descriptors: Response Style (Tests), Testing Problems, Testing Accommodations, Measurement

Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs

Peer reviewed

Direct link

Hyo Jeong Shin; Christoph König; Frederic Robin; Andreas Frey; Kentaro Yamamoto – Journal of Educational Measurement, 2025

Many international large-scale assessments (ILSAs) have switched to multistage adaptive testing (MST) designs to improve measurement efficiency in measuring the skills of the heterogeneous populations around the world. In this context, previous literature has reported the acceptable level of model parameter recovery under the MST designs when the…

Descriptors: Robustness (Statistics), Item Response Theory, Adaptive Testing, Test Construction

Fixed Effects or Mixed Effects Classifiers? Evidence from Simulated and Archival Data

Peer reviewed

Direct link

Mangino, Anthony A.; Bolin, Jocelyn H.; Finch, W. Holmes – Educational and Psychological Measurement, 2023

This study seeks to compare fixed and mixed effects models for the purposes of predictive classification in the presence of multilevel data. The first part of the study utilizes a Monte Carlo simulation to compare fixed and mixed effects logistic regression and random forests. An applied examination of the prediction of student retention in the…

Descriptors: Prediction, Classification, Monte Carlo Methods, Foreign Countries

Examining the Impact of Violations of Local Item Independence Assumption on Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Mehmet Fatih Doguyurt; Seref Tan – International Journal of Assessment Tools in Education, 2025

This study investigates the impact of violating the local item independence assumption by loading certain items onto a second dimension on test equating errors in unidimensional and dichotomous tests. The research was designed as a simulation study, using data generated based on the PISA 2018 mathematics exam. Analyses were conducted under 36…

Descriptors: Equated Scores, Test Items, Mathematics Tests, International Assessment

Performance of Coefficient Alpha and Its Alternatives: Effects of Different Types of Non-Normality

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023

We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…

Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions

Behavioral Patterns in Collaborative Problem Solving: A Latent Profile Analysis Based on Response Times and Actions in PISA 2015

Peer reviewed

Direct link

Han, Areum; Krieger, Florian; Borgonovi, Francesca; Greiff, Samuel – Large-scale Assessments in Education, 2023

Process data are becoming more and more popular in education research. In the field of computer-based assessments of collaborative problem solving (ColPS), process data have been used to identify students' test-taking strategies while working on the assessment, and such data can be used to complement data collected on accuracy and overall…

Descriptors: Behavior Patterns, Cooperative Learning, Problem Solving, Reaction Time

The Benefits of Fixed Item Parameter Calibration for Parameter Accuracy in Small Sample Situations in Large-Scale Assessments

Peer reviewed

Direct link

König, Christoph; Khorramdel, Lale; Yamamoto, Kentaro; Frey, Andreas – Educational Measurement: Issues and Practice, 2021

Large-scale assessments such as the Programme for International Student Assessment (PISA) have field trials where new survey features are tested for utility in the main survey. Because of resource constraints, there is a trade-off between how much of the sample can be used to test new survey features and how much can be used for the initial item…

Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment

The Effect of the Normalization Method Used in Different Sample Sizes on the Success of Artificial Neural Network Model

Peer reviewed
PDF on ERIC

Download full text

Aksu, Gökhan; Güzeller, Cem Oktay; Eser, Mehmet Taha – International Journal of Assessment Tools in Education, 2019

In this study, it was aimed to compare different normalization methods employed in model developing process via artificial neural networks with different sample sizes. As part of comparison of normalization methods, input variables were set as: work discipline, environmental awareness, instrumental motivation, science self-efficacy, and weekly…

Descriptors: Sample Size, Artificial Intelligence, Classification, Statistical Analysis

Estimation of Mixture Rasch Models from Skewed Latent Ability Distributions

Peer reviewed

Direct link

Karadavut, Tugba; Cohen, Allan S.; Kim, Seock-Ho – Measurement: Interdisciplinary Research and Perspectives, 2020

Mixture Rasch (MixRasch) models conventionally assume normal distributions for latent ability. Previous research has shown that the assumption of normality is often unmet in educational and psychological measurement. When normality is assumed, asymmetry in the actual latent ability distribution has been shown to result in extraction of spurious…

Descriptors: Item Response Theory, Ability, Statistical Distributions, Sample Size

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Comparison of Classification Performances of Mathematics Achievement at PISA 2012 with the Artificial Neural Network, Decision Trees and Discriminant Analysis

Peer reviewed
PDF on ERIC

Download full text

Toprak, Emre; Gelbal, Selahattin – International Journal of Assessment Tools in Education, 2020

This study aims to compare the performances of the artificial neural network, decision trees and discriminant analysis methods to classify student achievement. The study uses multilayer perceptron model to form the artificial neural network model, chi-square automatic interaction detection (CHAID) algorithm to apply the decision trees method and…

Descriptors: Comparative Analysis, Classification, Artificial Intelligence, Networks

Use of Standard Deviations as Predictors in Models Using Large-Scale International Data Sets

Peer reviewed

Direct link

Austin, Bruce; French, Brian; Adesope, Olusola; Gotch, Chad – Journal of Experimental Education, 2017

Measures of variability are successfully used in predictive modeling in research areas outside of education. This study examined how standard deviations can be used to address research questions not easily addressed using traditional measures such as group means based on index variables. Student survey data were obtained from the Organisation for…

Descriptors: Predictor Variables, Models, Predictive Measurement, Statistical Analysis

Benchmarking London in the PISA Rankings

Peer reviewed
PDF on ERIC

Download full text

Jerrim, John; Wyness, Gill – London Review of Education, 2016

The Programme for International Student Assessment (PISA) is an important international study of 15-year-olds' academic achievement. Although PISA has traditionally been used to draw comparisons across countries, there is growing interest in the production of regional (i.e. city,state, or provincial level) results. In this paper we present the…

Descriptors: Benchmarking, Foreign Countries, Achievement Tests, International Assessment

Automatic Coding of Short Text Responses via Clustering in Educational Assessment

Peer reviewed

Direct link

Zehner, Fabian; Sälzer, Christine; Goldhammer, Frank – Educational and Psychological Measurement, 2016

Automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the "Programme…

Descriptors: Educational Assessment, Coding, Automation, Responses

Previous Page | Next Page »

Pages: 1 | 2

Sample Size	18
Foreign Countries	17
Achievement Tests	15
International Assessment	15
Secondary School Students	14
Comparative Analysis	6
Scores	6
Test Items	6
Statistical Analysis	5
Classification	4
Correlation	4
Item Response Theory	4
Models	4
Academic Achievement	3
Accuracy	3
Error of Measurement	3
Mathematics Achievement	3
Artificial Intelligence	2
Computer Assisted Testing	2
Cross Cultural Studies	2
Educational Assessment	2
Error Patterns	2
Evaluation Methods	2
Item Analysis	2
Mathematics Tests	2
More ▼

Abulela, Mohammed A. A.	1
Adesope, Olusola	1
Aksu, Gökhan	1
Ana Balcão Reis	1
Andreas Frey	1
Austin, Bruce	1
Bolin, Jocelyn H.	1
Borgonovi, Francesca	1
Christoph König	1
Cohen, Allan S.	1
Eser, Mehmet Taha	1
Finch, W. Holmes	1
Frederic Robin	1
French, Brian	1
Frey, Andreas	1
Gelbal, Selahattin	1
Goldhammer, Frank	1
Gotch, Chad	1
Greiff, Samuel	1
Guher Gorgun	1
Güzeller, Cem Oktay	1
Hacer Karamese	1
Han, Areum	1
Hau, Kit-Tai	1
Hyo Jeong Shin	1
More ▼