Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 10 |
Descriptor
Computation | 10 |
Error of Measurement | 10 |
Item Response Theory | 6 |
Models | 4 |
Sample Size | 3 |
Accuracy | 2 |
Data Analysis | 2 |
Goodness of Fit | 2 |
Predictive Validity | 2 |
Scores | 2 |
Statistical Analysis | 2 |
More ▼ |
Source
Online Submission | 5 |
AERA Online Paper Repository | 1 |
ETS Research Report Series | 1 |
Educational Testing Service | 1 |
International Educational… | 1 |
Mid-Western Educational… | 1 |
Author
Custer, Michael | 3 |
Dimitrov, Dimiter M. | 1 |
Durand, Guillaume | 1 |
Ferron, John M. | 1 |
Frey, Sharon | 1 |
Goodwyn, Fara | 1 |
Goutte, Cyril | 1 |
Guo, Hongwen | 1 |
Herreid, Charlene | 1 |
Joo, Seang-hwane | 1 |
Kim, Jongpil | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 10 |
Reports - Research | 5 |
Reports - Evaluative | 3 |
Journal Articles | 2 |
Information Analyses | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model
Custer, Michael; Kim, Jongpil – Online Submission, 2023
This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…
Descriptors: Sample Size, Item Response Theory, Test Items, Computation
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018
Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…
Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping
Durand, Guillaume; Goutte, Cyril; Léger, Serge – International Educational Data Mining Society, 2018
Knowledge tracing is a fundamental area of educational data modeling that aims at gaining a better understanding of the learning occurring in tutoring systems. Knowledge tracing models fit various parameters on observed student performance and are evaluated through several goodness of fit metrics. Fitted parameter values are of crucial interest in…
Descriptors: Error of Measurement, Models, Goodness of Fit, Predictive Validity
Joo, Seang-hwane; Wang, Yan; Ferron, John M. – AERA Online Paper Repository, 2017
Multiple-baseline studies provide meta-analysts the opportunity to compute effect sizes based on either within-series comparisons of treatment phase to baseline phase observations, or time specific between-series comparisons of observations from those that have started treatment to observations of those that are still in baseline. The advantage of…
Descriptors: Meta Analysis, Effect Size, Hierarchical Linear Modeling, Computation
Custer, Michael – Online Submission, 2015
This study examines the relationship between sample size and item parameter estimation precision when utilizing the one-parameter model. Item parameter estimates are examined relative to "true" values by evaluating the decline in root mean squared deviation (RMSD) and the number of outliers as sample size increases. This occurs across…
Descriptors: Sample Size, Item Response Theory, Computation, Accuracy
Goodwyn, Fara – Online Submission, 2012
Location estimates calculated from heuristic data were examined using traditional and robust statistical methods. The current paper demonstrates the impact outliers have on the sample mean and proposes robust methods to control for outliers in sample data. Traditional methods fail because they rely on the statistical assumptions of normality and…
Descriptors: Robustness (Statistics), Statistical Analysis, Computation, Error of Measurement
Guo, Hongwen; Sinharay, Sandip – Educational Testing Service, 2011
Nonparametric, or kernel, estimation of item response curve (IRC) is a concern theoretically and operationally. Accuracy of this estimation, often used in item analysis in testing programs, is biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. In this study, we investigate…
Descriptors: Error of Measurement, Nonparametric Statistics, Item Response Theory, Computation
Dimitrov, Dimiter M. – Mid-Western Educational Researcher, 2010
The focus of this presidential address is on the contemporary treatment of reliability and validity in educational assessment. Highlights on reliability are provided under the classical true-score model using tools from latent trait modeling to clarify important assumptions and procedures for reliability estimation. In addition to reliability,…
Descriptors: Educational Assessment, Validity, Item Response Theory, Reliability
Micceri, Theodore; Parasher, Pradnya; Waugh, Gordon W.; Herreid, Charlene – Online Submission, 2009
An extensive review of the research literature and a study comparing over 36,000 survey responses with archival true scores indicated that one should expect a minimum of at least three percent random error for the least ambiguous of self-report measures. The Gulliver Effect occurs when a small proportion of error in a sizable subpopulation exerts…
Descriptors: Error of Measurement, Minority Groups, Measurement, Computation
Custer, Michael; Sharairi, Sid; Yamazaki, Kenji; Signatur, Diane; Swift, David; Frey, Sharon – Online Submission, 2008
The present study compared item and ability invariance as well as model-data fit between the one-parameter (1PL) and three-parameter (3PL) Item Response Theory (IRT) models utilizing real data across five grades; second through sixth as well as simulated data at second, fourth and sixth grade. At each grade, the 1PL and 3PL IRT models were run…
Descriptors: Error of Measurement, Item Response Theory, Models, Goodness of Fit