ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	25

Descriptor

Error Patterns	32
Evaluation Methods	32
Models	32
Computation	10
Simulation	9
Classification	6
Student Evaluation	6
Comparative Analysis	5
Item Response Theory	5
Prediction	5
Scores	5
Data Analysis	4
Educational Research	4
Eye Movements	4
Feedback (Response)	4
Goodness of Fit	4
Probability	4
Problem Solving	4
Psychometrics	4
Test Items	4
Bayesian Statistics	3
Children	3
College Students	3
Data Collection	3
Equations (Mathematics)	3
More ▼

Publication Type

Journal Articles	24
Reports - Research	20
Reports - Evaluative	4
Reports - Descriptive	3
Collected Works - Proceedings	2
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Opinion Papers	1

Education Level

Higher Education	5
Postsecondary Education	3
Elementary Education	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Adult Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 6	1
Intermediate Grades	1
More ▼

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

Finland	1
France	1
Germany	1
Massachusetts	1
Uruguay	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Something's Different: Elaboration's Transferrable Role for False Alarm Reduction

Peer reviewed

Direct link

Lauren A. Mason; Abigail Miller; Gregory Hughes; Holly A. Taylor – Cognitive Research: Principles and Implications, 2025

False alarming, or detecting an error when there is not one, is a pervasive problem across numerous industries. The present study investigated the role of elaboration, or additional information about non-error differences in complex visual displays, for mitigating false error responding. In Experiment 1, learners studied errors and non-error…

Descriptors: Error Correction, Error Patterns, Evaluation Methods, Visual Aids

Detecting Differential Item Functioning Using Posterior Predictive Model Checking: A Comparison of Discrepancy Statistics

Peer reviewed

Direct link

Joo, Seang-Hwane; Lee, Philseok – Journal of Educational Measurement, 2022

Abstract This study proposes a new Bayesian differential item functioning (DIF) detection method using posterior predictive model checking (PPMC). Item fit measures including infit, outfit, observed score distribution (OSD), and Q1 were considered as discrepancy statistics for the PPMC DIF methods. The performance of the PPMC DIF method was…

Descriptors: Test Items, Bayesian Statistics, Monte Carlo Methods, Prediction

Improving Automated Scoring of Student Open Responses in Mathematics

Peer reviewed
PDF on ERIC

Download full text

Baral, Sami; Botelho, Anthony F.; Erickson, John A.; Benachamardi, Priyanka; Heffernan, Neil T. – International Educational Data Mining Society, 2021

Open-ended questions in mathematics are commonly used by teachers to monitor and assess students' deeper conceptual understanding of content. Student answers to these types of questions often exhibit a combination of language, drawn diagrams and tables, and mathematical formulas and expressions that supply teachers with insight into the processes…

Descriptors: Scoring, Automation, Mathematics Tests, Student Evaluation

The Effects of Incomplete Rating Designs in Combination with Rater Effects

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Journal of Educational Measurement, 2019

Researchers have explored a variety of topics related to identifying and distinguishing among specific types of rater effects, as well as the implications of different types of incomplete data collection designs for rater-mediated assessments. In this study, we used simulated data to examine the sensitivity of latent trait model indicators of…

Descriptors: Rating Scales, Models, Evaluators, Data Collection

When Labeling L2 Users as Nativelike or Not, Consider Classification Errors

Peer reviewed

Direct link

Vanhove, Jan – Second Language Research, 2020

Researchers commonly estimate the prevalence of nativelikeness among second-language learners by assessing how many of them perform similarly to a sample of native speakers on one or several linguistic tasks. Even when the native (L1) samples and second-language (L2) samples are comparable in terms of age, socio-economic status, educational…

Descriptors: Second Language Learning, Native Speakers, Labeling (of Persons), Classification

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

A Skew-Normal Mixture Regression Model

Peer reviewed

Direct link

Liu, Min; Lin, Tsung-I – Educational and Psychological Measurement, 2014

A challenge associated with traditional mixture regression models (MRMs), which rest on the assumption of normally distributed errors, is determining the number of unobserved groups. Specifically, even slight deviations from normality can lead to the detection of spurious classes. The current work aims to (a) examine how sensitive the commonly…

Descriptors: Regression (Statistics), Evaluation Methods, Indexes, Models

Why We (Usually) Don't Have to Worry about Multiple Comparisons

Peer reviewed

Direct link

Gelman, Andrew; Hill, Jennifer; Yajima, Masanao – Journal of Research on Educational Effectiveness, 2012

Applied researchers often find themselves making statistical inferences in settings that would seem to require multiple comparisons adjustments. We challenge the Type I error paradigm that underlies these corrections. Moreover we posit that the problem of multiple comparisons can disappear entirely when viewed from a hierarchical Bayesian…

Descriptors: Intervals, Comparative Analysis, Inferences, Error Patterns

Using Robust Standard Errors to Combine Multiple Regression Estimates with Meta-Analysis

Direct link

Williams, Ryan T. – ProQuest LLC, 2012

Combining multiple regression estimates with meta-analysis has continued to be a difficult task. A variety of methods have been proposed and used to combine multiple regression slope estimates with meta-analysis, however, most of these methods have serious methodological and practical limitations. The purpose of this study was to explore the use…

Descriptors: Multiple Regression Analysis, Meta Analysis, Evaluation Methods, Computation

Immediate Feedback and Opportunity to Revise Answers: Application of a Graded Response IRT Model

Peer reviewed

Direct link

Attali, Yigal – Applied Psychological Measurement, 2011

Recently, Attali and Powers investigated the usefulness of providing immediate feedback on the correctness of answers to constructed response questions and the opportunity to revise incorrect answers. This article introduces an item response theory (IRT) model for scoring revised responses to questions when several attempts are allowed. The model…

Descriptors: Feedback (Response), Item Response Theory, Models, Error Correction

Assessing Algebraic Solving Ability: A Theoretical Framework

Peer reviewed
PDF on ERIC

Download full text

Lian, Lim Hooi; Yew, Wun Thiam – International Education Studies, 2012

Algebraic solving ability had been discussed by many educators and researchers. There exists no definite definition for algebraic solving ability as it can be viewed from different perspectives. In this paper, the nature of algebraic solving ability in terms of algebraic processes that demonstrate the ability in solving algebraic problem is…

Descriptors: Algebra, Mathematics Skills, Problem Solving, Evaluation Methods

The Effects of Initially Misclassified Data on the Effectiveness of Discriminant Function Analysis and Finite Mixture Modeling

Peer reviewed

Direct link

Holden, Jocelyn E.; Kelley, Ken – Educational and Psychological Measurement, 2010

Classification procedures are common and useful in behavioral, educational, social, and managerial research. Supervised classification techniques such as discriminant function analysis assume training data are perfectly classified when estimating parameters or classifying. In contrast, unsupervised classification techniques such as finite mixture…

Descriptors: Discriminant Analysis, Classification, Computation, Behavioral Science Research

Acquired Equivalence Changes Stimulus Representations

Peer reviewed
PDF on ERIC

Download full text

Direct link

Meeter, M.; Shohamy, D.; Myers, C. E. – Journal of the Experimental Analysis of Behavior, 2009

Acquired equivalence is a paradigm in which generalization is increased between two superficially dissimilar stimuli (or antecedents) that have previously been associated with similar outcomes (or consequents). Several possible mechanisms have been proposed, including changes in stimulus representations, either in the form of added associations or…

Descriptors: Stimuli, Models, Classification, Groups

Estimation of IRT Graded Response Models: Limited versus Full Information Methods

Peer reviewed

Direct link

Forero, Carlos G.; Maydeu-Olivares, Alberto – Psychological Methods, 2009

The performance of parameter estimates and standard errors in estimating F. Samejima's graded response model was examined across 324 conditions. Full information maximum likelihood (FIML) was compared with a 3-stage estimator for categorical item factor analysis (CIFA) when the unweighted least squares method was used in CIFA's third stage. CIFA…

Descriptors: Factor Analysis, Least Squares Statistics, Computation, Item Response Theory

The Performance of Multilevel Growth Curve Models under an Autoregressive Moving Average Process

Peer reviewed

Direct link

Murphy, Daniel L.; Pituch, Keenan A. – Journal of Experimental Education, 2009

The authors examined the robustness of multilevel linear growth curve modeling to misspecification of an autoregressive moving average process. As previous research has shown (J. Ferron, R. Dailey, & Q. Yi, 2002; O. Kwok, S. G. West, & S. B. Green, 2007; S. Sivo, X. Fan, & L. Witta, 2005), estimates of the fixed effects were unbiased, and Type I…

Descriptors: Sample Size, Computation, Evaluation Methods, Longitudinal Studies

Previous Page | Next Page »

Pages: 1 | 2 | 3

International Educational…	3
Journal of Educational…	3
Applied Psychological…	2
Educational and Psychological…	2
Psychological Review	2
Cognitive Psychology	1
Cognitive Research:…	1
College and Research Libraries	1
Developmental Science	1
International Education…	1
International Journal of…	1
Journal of Experimental…	1
Journal of Research on…	1
Journal of the Experimental…	1
ProQuest LLC	1
Psychological Methods	1
Psychometrika	1
Reading Research Quarterly	1
Second Language Research	1
Structural Equation Modeling:…	1
Teaching Exceptional Children	1
More ▼

Abigail Miller	1
Andrews, Dee H.	1
Attali, Yigal	1
Baral, Sami	1
Benachamardi, Priyanka	1
Botelho, Anthony F.	1
Boyer, Kristy Elizabeth, Ed.	1
Brown, Scott D.	1
Burstein, Leigh	1
Donkin, Christopher	1
Erickson, John A.	1
Forero, Carlos G.	1
Forster, Kenneth I.	1
Gable, Robert A.	1
Gelman, Andrew	1
Gregory Hughes	1
Heathcote, Andrew	1
Heffernan, Neil T.	1
Hill, Jennifer	1
Holden, Jocelyn E.	1
Holly A. Taylor	1
Hou, Likun	1
Jedidi, Kamel	1
Jones, Eli	1
More ▼