ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	36

Descriptor

Computation	39
Reliability	39
Models	10
Error of Measurement	9
Statistical Analysis	8
Correlation	7
Item Response Theory	7
Measures (Individuals)	7
Classification	6
Scores	6
Intervals	5
Measurement	5
Psychometrics	5
Sampling	5
Accuracy	4
Data Analysis	4
Evaluation Methods	4
Factor Analysis	4
Monte Carlo Methods	4
Predictive Validity	4
Structural Equation Models	4
Computer Software	3
Decision Making	3
Generalizability Theory	3
High Stakes Tests	3
More ▼

Publication Type

Reports - Descriptive	39
Journal Articles	38
Opinion Papers	1

Education Level

Elementary Secondary Education	3
Elementary Education	2
Higher Education	2
Postsecondary Education	2
Grade 5	1

Audience

Researchers

Location

Missouri	1
New York	1
Pakistan	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Eysenck Personality Inventory

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

A Computationally Simple Method for Estimating Decision Consistency

Peer reviewed

Direct link

Wolkowitz, Amanda A. – Journal of Educational Measurement, 2021

Decision consistency (DC) is the reliability of a classification decision based on a test score. In professional credentialing, the decision is often a high-stakes pass/fail decision. The current methods for estimating DC are computationally complex. The purpose of this research is to provide a computationally and conceptually simple method for…

Descriptors: Decision Making, Reliability, Classification, Scores

On the Added Value of Multiple Factor Score Estimates in Essentially Unidimensional Models

Peer reviewed

Direct link

Ferrando, Pere J.; Lorenzo-Seva, Urbano – Educational and Psychological Measurement, 2019

Measures initially designed to be single-trait often yield data that are compatible with both an essentially unidimensional factor-analysis (FA) solution and a correlated-factors solution. For these cases, this article proposes an approach aimed at providing information for deciding which of the two solutions is the most appropriate and useful.…

Descriptors: Factor Analysis, Computation, Reliability, Goodness of Fit

Item Response Theory-Based Methods for Estimating Classification Accuracy and Consistency

Peer reviewed

Direct link

Diao, Hongyu; Sireci, Stephen G. – Journal of Applied Testing Technology, 2018

Whenever classification decisions are made on educational tests, such as pass/fail, or basic, proficient, or advanced, the consistency and accuracy of those decisions should be estimated and reported. Methods for estimating the reliability of classification decisions made on the basis of educational tests are well-established (e.g., Rudner, 2001;…

Descriptors: Classification, Item Response Theory, Accuracy, Reliability

Moving beyond Alpha: A Primer on Alternative Sources of Single-Administration Reliability Evidence for Quantitative Chemistry Education

Peer reviewed

Direct link

Komperda, Regis; Pentecost, Thomas C.; Barbera, Jack – Journal of Chemical Education, 2018

This methodological paper examines current conceptions of reliability in chemistry education research (CER) and provides recommendations for moving beyond the current reliance on reporting coefficient alpha (a) as reliability evidence without regard to its appropriateness for the research context. To help foster a better understanding of…

Descriptors: Chemistry, Science Instruction, Teaching Methods, Reliability

Practical Issues in Estimating Classification Accuracy and Consistency with R Package cacIRT

Peer reviewed
PDF on ERIC

Download full text

Lathrop, Quinn N. – Practical Assessment, Research & Evaluation, 2015

There are two main lines of research in estimating classification accuracy (CA) and classification consistency (CC) under Item Response Theory (IRT). The R package cacIRT provides computer implementations of both approaches in an accessible and unified framework. Even with available implementations, there remains decisions a researcher faces when…

Descriptors: Classification, Accuracy, Item Response Theory, Reliability

A Direct Latent Variable Modeling Based Method for Point and Interval Estimation of Coefficient Alpha

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015

A direct approach to point and interval estimation of Cronbach's coefficient alpha for multiple component measuring instruments is outlined. The procedure is based on a latent variable modeling application with widely circulated software. As a by-product, using sample data the method permits ascertaining whether the population discrepancy…

Descriptors: Computation, Statistical Analysis, Reliability, Models

Avoiding Bias When Estimating the Consistency and Stability of Value-Added School Effects

Peer reviewed

Direct link

Leckie, George – Journal of Educational and Behavioral Statistics, 2018

The traditional approach to estimating the consistency of school effects across subject areas and the stability of school effects across time is to fit separate value-added multilevel models to each subject or cohort and to correlate the resulting empirical Bayes predictions. We show that this gives biased correlations and these biases cannot be…

Descriptors: Value Added Models, Reliability, Statistical Bias, Computation

What to Do Instead of Significance Testing? Calculating the 'Number of Counterfactual Cases Needed to Disturb a Finding'

Peer reviewed

Direct link

Gorard, Stephen; Gorard, Jonathan – International Journal of Social Research Methodology, 2016

This brief paper introduces a new approach to assessing the trustworthiness of research comparisons when expressed numerically. The 'number needed to disturb' a research finding would be the number of counterfactual values that can be added to the smallest arm of any comparison before the difference or 'effect' size disappears, minus the number of…

Descriptors: Statistical Significance, Testing, Sampling, Attrition (Research Studies)

A Fuzzy Group Decision Making Model for Ordinal Peer Assessment

Peer reviewed

Direct link

Capuano, Nicola; Loia, Vincenzo; Orciuoli, Francesco – IEEE Transactions on Learning Technologies, 2017

Massive Open Online Courses (MOOCs) are becoming an increasingly popular choice for education but, to reach their full extent, they require the resolution of new issues like assessing students at scale. A feasible approach to tackle this problem is peer assessment, in which students also play the role of assessor for assignments submitted by…

Descriptors: Participative Decision Making, Models, Peer Evaluation, Online Courses

Sample Size Determination for Regression Models Using Monte Carlo Methods in R

Peer reviewed
PDF on ERIC

Download full text

Beaujean, A. Alexander – Practical Assessment, Research & Evaluation, 2014

A common question asked by researchers using regression models is, What sample size is needed for my study? While there are formulae to estimate sample sizes, their assumptions are often not met in the collected data. A more realistic approach to sample size determination requires more information such as the model of interest, strength of the…

Descriptors: Regression (Statistics), Sample Size, Sampling, Monte Carlo Methods

A Computational Approach to Investigate Properties of Estimators

Peer reviewed

Direct link

Caudle, Kyle A.; Ruth, David M. – Journal of Computers in Mathematics and Science Teaching, 2013

Teaching undergraduates the basic properties of an estimator can be difficult. Most definitions are easy enough to comprehend, but difficulties often lie in gaining a "good feel" for these properties and why one property might be more desired as compared to another property. Simulations which involve visualization of these properties can…

Descriptors: Computation, Statistics, College Mathematics, Mathematics Instruction

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Estimating Ordinal Reliability for Likert-Type and Ordinal Item Response Data: A Conceptual, Empirical, and Practical Guide

Peer reviewed
PDF on ERIC

Download full text

Gadermann, Anne M.; Guhn, Martin; Zumbo, Bruno D. – Practical Assessment, Research & Evaluation, 2012

This paper provides a conceptual, empirical, and practical guide for estimating ordinal reliability coefficients for ordinal item response data (also referred to as Likert, Likert-type, ordered categorical, or rating scale item responses). Conventionally, reliability coefficients, such as Cronbach's alpha, are calculated using a Pearson…

Descriptors: Likert Scales, Rating Scales, Reliability, Computation

Evaluation of Validity and Reliability for Hierarchical Scales Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Structural Equation Modeling: A Multidisciplinary Journal, 2012

A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…

Descriptors: Predictive Validity, Reliability, Structural Equation Models, Measures (Individuals)

Some Supplementary Methods for the Analysis of the RBANS

Peer reviewed

Direct link

Crawford, John R.; Garthwaite, Paul H.; Morrice, Nicola; Duff, Kevin – Psychological Assessment, 2012

Supplementary methods for the analysis of the Repeatable Battery for the Assessment of Neuropsychological Status are made available, including (a) quantifying the number of abnormally low Index scores and abnormally large differences exhibited by a case and accompanying this with estimates of the percentages of the normative population expected to…

Descriptors: Neurological Impairments, Cognitive Tests, Psychological Testing, Adults

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Psychological…	6
Structural Equation Modeling:…	4
Practical Assessment,…	3
Educational and Psychological…	2
Journal of Educational and…	2
Psychometrika	2
Teaching Statistics: An…	2
ETS Research Report Series	1
IEEE Transactions on Learning…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Chemical Education	1
Journal of Computers in…	1
Journal of Educational…	1
Journal of Human Resources	1
Journal of Special Education	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
National Assessment Governing…	1
Peabody Journal of Education	1
Psicologica: International…	1
Psychological Assessment	1
Psychological Methods	1
Psychology in the Schools	1
More ▼

Raykov, Tenko	7
Marcoulides, George A.	3
Brennan, Robert L.	2
Culpepper, Steven Andrew	2
Davis, John L.	2
Ferrando, Pere J.	2
Lorenzo-Seva, Urbano	2
Vannest, Kimberly J.	2
Andrabi, Tahir	1
Asparouhov, Tihomir	1
Barbera, Jack	1
Beaujean, A. Alexander	1
Biemer, Paul P.	1
Capuano, Nicola	1
Caudle, Kyle A.	1
Chico, Eliseo	1
Christ, Sharon L.	1
Clemens, Nathan H.	1
Crawford, John R.	1
Das, Jishnu	1
Diao, Hongyu	1
Dimitrov, Dimiter M.	1
Duff, Kevin	1
Duncombe, William	1
Fearing, Benjamin K.	1
More ▼