ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	19

Descriptor

Classification	23
Sample Size	23
Simulation	23
Item Response Theory	9
Models	7
Statistical Analysis	7
Accuracy	5
Test Items	5
Bayesian Statistics	4
Computation	4
Correlation	4
Effect Size	4
Error of Measurement	4
Sampling	4
Scores	4
Difficulty Level	3
Educational Research	3
Goodness of Fit	3
Item Analysis	3
Measurement	3
Probability	3
Scoring	3
Achievement Tests	2
Adaptive Testing	2
Comparative Analysis	2
More ▼

Source

Educational and Psychological…	3
Applied Psychological…	2
Journal of Educational…	2
American Journal of…	1
Applied Measurement in…	1
Educational Research and…	1
International Journal of…	1
International Journal of…	1
Journal of Education in…	1
Journal of Educational and…	1
Journal of Experimental…	1
Measurement:…	1
ProQuest LLC	1
Review of Educational Research	1
Sociological Methods &…	1
Structural Equation Modeling:…	1
Studies in Higher Education	1
More ▼

Publication Type

Journal Articles	20
Reports - Research	19
Reports - Evaluative	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1

Education Level

High Schools	2
Secondary Education	2
Elementary Secondary Education	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Florida (Miami)	1
Indiana	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Implementing a Standardized Effect Size in the POLYSIBTEST Procedure

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023

A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…

Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis

The 3H and Spiral Dynamics Models: A Reconciliation

Peer reviewed
PDF on ERIC

Download full text

Jehanzeb Rashid Cheema – Journal of Education in Muslim Societies, 2024

This study explores the relationship between the Spiral Dynamics and the 3H (head, heart, hands) models of human growth and development, using constructs such as empathy, moral reasoning, forgiveness, and community mindedness that have been shown to have implications for education. The specific research question is, "Can a combination of…

Descriptors: Correlation, Factor Analysis, Computer Software, Moral Values

A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

Peer reviewed

Direct link

Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…

Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance

An Investigation of Item Calibration Methods in Multistage Testing

Peer reviewed

Direct link

Cai, Liuhan; Albano, Anthony D.; Roussos, Louis A. – Measurement: Interdisciplinary Research and Perspectives, 2021

Multistage testing (MST), an adaptive test delivery mode that involves algorithmic selection of predefined item modules rather than individual items, offers a practical alternative to linear and fully computerized adaptive testing. However, interactions across stages between item modules and examinee groups can lead to challenges in item…

Descriptors: Adaptive Testing, Test Items, Item Response Theory, Test Construction

Investigation of the Effect of Parameter Estimation and Classification Accuracy in Mixture IRT Models under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Saatcioglu, Fatima Munevver; Atar, Hakan Yavuz – International Journal of Assessment Tools in Education, 2022

This study aims to examine the effects of mixture item response theory (IRT) models on item parameter estimation and classification accuracy under different conditions. The manipulated variables of the simulation study are set as mixture IRT models (Rasch, 2PL, 3PL); sample size (600, 1000); the number of items (10, 30); the number of latent…

Descriptors: Accuracy, Classification, Item Response Theory, Programming Languages

Governments Harnessing the Power of Data to Get 'Value for Money': A Simulation Study of England's Office for Students B3 Proceed Metric

Peer reviewed

Direct link

Bradley, Alex; Quigley, Martyn – Studies in Higher Education, 2023

The mass participation in higher education has led to greater spending by governments and students which has increased the focus on graduate outcomes. In England, the Office for Students (OfS) is planning to take regulatory action, using the Proceed metric, against universities and their courses which do not have 60% of students with positive…

Descriptors: Foreign Countries, Higher Education, Education Work Relationship, Outcomes of Education

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

Invariance Properties for General Diagnostic Classification Models

Peer reviewed

Direct link

Bradshaw, Laine P.; Madison, Matthew J. – International Journal of Testing, 2016

In item response theory (IRT), the invariance property states that item parameter estimates are independent of the examinee sample, and examinee ability estimates are independent of the test items. While this property has long been established and understood by the measurement community for IRT models, the same cannot be said for diagnostic…

Descriptors: Classification, Models, Simulation, Psychometrics

Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

Peer reviewed

Direct link

Suh, Youngsuk – Journal of Educational Measurement, 2016

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…

Descriptors: Effect Size, Goodness of Fit, Statistical Analysis, Statistical Significance

Bayesian Analysis and Design for Joint Modeling of Two Binary Responses with Misclassification

Peer reviewed

Direct link

Stamey, James D.; Beavers, Daniel P.; Sherr, Michael E. – Sociological Methods & Research, 2017

Survey data are often subject to various types of errors such as misclassification. In this article, we consider a model where interest is simultaneously in two correlated response variables and one is potentially subject to misclassification. A motivating example of a recent study of the impact of a sexual education course for adolescents is…

Descriptors: Bayesian Statistics, Classification, Models, Correlation

Challenging Conventional Wisdom for Multivariate Statistical Models with Small Samples

Peer reviewed

Direct link

McNeish, Daniel – Review of Educational Research, 2017

In education research, small samples are common because of financial limitations, logistical challenges, or exploratory studies. With small samples, statistical principles on which researchers rely do not hold, leading to trust issues with model estimates and possible replication issues when scaling up. Researchers are generally aware of such…

Descriptors: Models, Statistical Analysis, Sampling, Sample Size

Two Approaches to Estimation of Classification Accuracy Rate under Item Response Theory

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Applied Psychological Measurement, 2013

Within the framework of item response theory (IRT), there are two recent lines of work on the estimation of classification accuracy (CA) rate. One approach estimates CA when decisions are made based on total sum scores, the other based on latent trait estimates. The former is referred to as the Lee approach, and the latter, the Rudner approach,…

Descriptors: Item Response Theory, Accuracy, Classification, Computation

An Evaluation of Information Criteria Use for Correct Cross-Classified Random Effects Model Selection

Peer reviewed

Direct link

Beretvas, S. Natasha; Murphy, Daniel L. – Journal of Experimental Education, 2013

The authors assessed correct model identification rates of Akaike's information criterion (AIC), corrected criterion (AICC), consistent AIC (CAIC), Hannon and Quinn's information criterion (HQIC), and Bayesian information criterion (BIC) for selecting among cross-classified random effects models. Performance of default values for the 5…

Descriptors: Models, Goodness of Fit, Evaluation Criteria, Educational Research

Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

Peer reviewed

Direct link

Li, Ying; Rupp, Andre A. – Educational and Psychological Measurement, 2011

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…

Descriptors: Test Length, Item Response Theory, Statistical Analysis, Error Patterns

Previous Page | Next Page »

Pages: 1 | 2

Abulela, Mohammed A. A.	1
Albano, Anthony D.	1
Ames, Allison	1
Atar, Hakan Yavuz	1
Beauducel, Andre	1
Beavers, Daniel P.	1
Beretvas, S. Natasha	1
Bradley, Alex	1
Bradshaw, Laine P.	1
Cai, Liuhan	1
Chan, Wendy	1
Cheng, Ying	1
Crawford, Brandon	1
Debelak, Rudolf	1
Deng, Weiling	1
Ferrara, F. Felicia	1
Henninger, Mirka	1
Herzberg, Philipp Yorck	1
Hong, Yuan	1
Jehanzeb Rashid Cheema	1
Kubinger, Klaus D.	1
Lathrop, Quinn N.	1
Li, Ying	1
Liang, Xinya	1
Madison, Matthew J.	1
More ▼