Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 31 |
Descriptor
Computation | 31 |
Models | 31 |
Reliability | 31 |
Statistical Analysis | 9 |
Correlation | 7 |
Item Response Theory | 7 |
Comparative Analysis | 6 |
Error of Measurement | 6 |
Scores | 6 |
Validity | 6 |
Accuracy | 5 |
More ▼ |
Source
Author
Haberman, Shelby J. | 3 |
Raykov, Tenko | 3 |
Dimitrov, Dimiter M. | 2 |
Almond, Russell G. | 1 |
Alonso, Ariel | 1 |
Asparouhov, Tihomir | 1 |
Botella, Juan | 1 |
Boucher, Jean P. | 1 |
Brennan, Robert L. | 1 |
Capuano, Nicola | 1 |
Chen, Jianwei | 1 |
More ▼ |
Publication Type
Journal Articles | 27 |
Reports - Research | 15 |
Reports - Descriptive | 10 |
Reports - Evaluative | 3 |
Dissertations/Theses -… | 2 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Secondary Education | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Education | 1 |
Grade 4 | 1 |
Audience
Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Eysenck Personality Inventory | 1 |
Praxis Series | 1 |
What Works Clearinghouse Rating
Matthew J. Madison; Seungwon Chung; Junok Kim; Laine P. Bradshaw – Grantee Submission, 2023
Recent developments have enabled the modeling of longitudinal assessment data in a diagnostic classification model (DCM) framework. These longitudinal DCMs were developed to provide measures of student growth on a discrete scale in the form of attribute mastery transitions, thereby supporting categorical and criterion-referenced interpretations of…
Descriptors: Models, Cognitive Measurement, Diagnostic Tests, Classification
DeMars, Christine E. – Applied Measurement in Education, 2021
Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…
Descriptors: Item Response Theory, Test Items, Ability, Scores
The Reliability of the Posterior Probability of Skill Attainment in Diagnostic Classification Models
Johnson, Matthew S.; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2020
One common score reported from diagnostic classification assessments is the vector of posterior means of the skill mastery indicators. As with any assessment, it is important to derive and report estimates of the reliability of the reported scores. After reviewing a reliability measure suggested by Templin and Bradshaw, this article suggests three…
Descriptors: Reliability, Probability, Skill Development, Classification
Nicewander, W. Alan – Educational and Psychological Measurement, 2018
Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…
Descriptors: Error of Measurement, Correlation, Sample Size, Computation
Qian, Minghui; Hu, Ridong; Chen, Jianwei – EURASIA Journal of Mathematics, Science & Technology Education, 2016
Spatial panel data models have been widely studied and applied in both scientific and social science disciplines, especially in the analysis of spatial influence. In this paper, we consider the spatial dynamic nonparametric Durbin model (SDNDM) with fixed effects, which takes the nonlinear factors into account base on the spatial dynamic panel…
Descriptors: Nonparametric Statistics, Models, Hypothesis Testing, Statistical Analysis
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A direct approach to point and interval estimation of Cronbach's coefficient alpha for multiple component measuring instruments is outlined. The procedure is based on a latent variable modeling application with widely circulated software. As a by-product, using sample data the method permits ascertaining whether the population discrepancy…
Descriptors: Computation, Statistical Analysis, Reliability, Models
Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014
As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…
Descriptors: Item Response Theory, Reliability, Models, Computation
Capuano, Nicola; Loia, Vincenzo; Orciuoli, Francesco – IEEE Transactions on Learning Technologies, 2017
Massive Open Online Courses (MOOCs) are becoming an increasingly popular choice for education but, to reach their full extent, they require the resolution of new issues like assessing students at scale. A feasible approach to tackle this problem is peer assessment, in which students also play the role of assessor for assignments submitted by…
Descriptors: Participative Decision Making, Models, Peer Evaluation, Online Courses
López-López, José Antonio; Botella, Juan; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio – Journal of Educational and Behavioral Statistics, 2013
Since heterogeneity between reliability coefficients is usually found in reliability generalization studies, moderator analyses constitute a crucial step for that meta-analytic approach. In this study, different procedures for conducting mixed-effects meta-regression analyses were compared. Specifically, four transformation methods for the…
Descriptors: Reliability, Generalization, Meta Analysis, Regression (Statistics)
Raykov, Tenko; Dimitrov, Dimiter M.; Asparouhov, Tihomir – Structural Equation Modeling: A Multidisciplinary Journal, 2010
A method for interval estimation of scale reliability with discrete data is outlined. The approach is applicable with multi-item instruments consisting of binary measures, and is developed within the latent variable modeling methodology. The procedure is useful for evaluation of consistency of single measures and of sum scores from item sets…
Descriptors: Reliability, Computation, Models, Intervals
Steedle, Jeffrey T. – Assessment & Evaluation in Higher Education, 2012
Value-added scores from tests of college learning indicate how score gains compare to those expected from students of similar entering academic ability. Unfortunately, the choice of value-added model can impact results, and this makes it difficult to determine which results to trust. The research presented here demonstrates how value-added models…
Descriptors: College Outcomes Assessment, Postsecondary Education, Achievement Tests, Models
Culpepper, Steven Andrew – Applied Psychological Measurement, 2012
Measurement error significantly biases interaction effects and distorts researchers' inferences regarding interactive hypotheses. This article focuses on the single-indicator case and shows how to accurately estimate group slope differences by disattenuating interaction effects with errors-in-variables (EIV) regression. New analytic findings were…
Descriptors: Evidence, Test Length, Interaction, Regression (Statistics)
Zhou, Hong; Muellerleile, Paige; Ingram, Debra; Wong, Seok P. – Journal of Educational and Behavioral Statistics, 2011
Intraclass correlation coefficients (ICCs) are commonly used in behavioral measurement and psychometrics when a researcher is interested in the relationship among variables of a common class. The formulas for deriving ICCs, or generalizability coefficients, vary depending on which models are specified. This article gives the equations for…
Descriptors: Computation, Statistical Analysis, Generalizability Theory, Correlation
Christie, A.; Kamen, G.; Boucher, Jean P.; Inglis, J. Greig; Gabriel, David A. – Measurement in Physical Education and Exercise Science, 2010
The Hoffmann reflex is obtained through surface electromyographic recordings, and it is one of the most common neurophysiological techniques in exercise science. Measurement and evaluation of the peak-to-peak amplitude of the Hoffmann reflex has been guided by the observation that it is a variable response that requires multiple trials to obtain a…
Descriptors: Motor Reactions, Measurement Techniques, Comparative Analysis, Statistical Analysis
Dimitrov, Dimiter M. – Mid-Western Educational Researcher, 2010
The focus of this presidential address is on the contemporary treatment of reliability and validity in educational assessment. Highlights on reliability are provided under the classical true-score model using tools from latent trait modeling to clarify important assumptions and procedures for reliability estimation. In addition to reliability,…
Descriptors: Educational Assessment, Validity, Item Response Theory, Reliability