ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	10

Source

Educational and Psychological…	2
New Mexico Public Education…	2
Advances in Physiology…	1
Applied Psychological…	1
Assessment for Effective…	1
Diagnostique	1
European Journal of…	1
IEEE Transactions on Learning…	1
Journal of Educational…	1
Measurement and Evaluation in…	1
Measurement:…	1
Practical Assessment,…	1
Psychometrika	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Reports - Descriptive	16
Journal Articles	14
Numerical/Quantitative Data	2

Education Level

Elementary Secondary Education	3
Higher Education	2
Grade 4	1
Postsecondary Education	1

Audience

Researchers

Location

New Mexico	2
Germany	1
Russia	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Evaluating the Discrepancy between Scale Reliability and Cronbach's Coefficient Alpha Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Measurement: Interdisciplinary Research and Perspectives, 2023

This article outlines a readily applicable procedure for point and interval estimation of the population discrepancy between reliability and the popular Cronbach's coefficient alpha for unidimensional multi-component measuring instruments with uncorrelated errors, which are widely used in behavioral and social research. The method is developed…

Descriptors: Measurement, Test Reliability, Measurement Techniques, Error of Measurement

Calculating Conditional Reliability for Dynamic Measurement Model Capacity Estimates

Peer reviewed

Direct link

McNeish, Daniel; Dumas, Denis – Journal of Educational Measurement, 2018

Dynamic measurement modeling (DMM) is a recent framework for measuring developing constructs whose manifestation occurs after an assessment is administered (e.g., learning capacity). Empirical studies have suggested that DMM may improve consequential validity of test scores because DMM learning capacity estimates were shown to be much less related…

Descriptors: Measurement Techniques, Test Reliability, Accuracy, Computation

Generalizability Theory in R

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Lucht, Marissa – Practical Assessment, Research & Evaluation, 2019

Generalizability theory is a modern, powerful, and broad framework used to assess the reliability, or dependability, of measurements. While there exist classic works that explain the basic concepts and mathematical foundations of the method, there is currently a lack of resources addressing computational resources for those researchers wishing to…

Descriptors: Generalizability Theory, Test Reliability, Computer Software, Statistical Analysis

Reliability of Scales with Second-Order Structure: Evaluation of Coefficient Alpha's Population Slippage Using Latent Variable Modeling

Peer reviewed

Direct link

Raykov, Tenko; Goldammer, Philippe; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018

A readily applicable procedure is discussed that allows evaluation of the discrepancy between the popular coefficient alpha and the reliability coefficient of a scale with second-order factorial structure that is frequently of relevance in empirical educational and psychological research. The approach is developed within the framework of the…

Descriptors: Test Reliability, Factor Structure, Statistical Analysis, Computation

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

Making Do with What We Have: Use Your Bootstraps

Peer reviewed

Direct link

Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012

A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…

Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods

A Clarification of the Effects of Rapid Guessing on Coefficient [Alpha]: A Note on Attali's "Reliability of Speeded Number-Right Multiple-Choice Tests"

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Applied Psychological Measurement, 2009

Attali (2005) recently demonstrated that Cronbach's coefficient [alpha] estimate of reliability for number-right multiple-choice tests will tend to be deflated by speededness, rather than inflated as is commonly believed and taught. Although the methods, findings, and conclusions of Attali (2005) are correct, his article may inadvertently invite a…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Computation

Estimation of Reliability for Multiple-Component Measuring Instruments in Hierarchical Designs

Peer reviewed

Direct link

Raykov, Tenko; du Toit, Stephen H. C. – Structural Equation Modeling: A Multidisciplinary Journal, 2005

A method for estimation of reliability for multiple-component measuring instruments with clustered data is outlined. The approach is applicable with hierarchical designs where individuals are nested within higher order units and exhibit possibly related performance on components of a scale of interest. The procedure is developed within the…

Descriptors: Structural Equation Models, Computation, Measurement Techniques, Test Reliability

Two Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Raju, Nambury S.; Oshima, T.C. – Educational and Psychological Measurement, 2005

Two new prophecy formulas for estimating item response theory (IRT)-based reliability of a shortened or lengthened test are proposed. Some of the relationships between the two formulas, one of which is identical to the well-known Spearman-Brown prophecy formula, are examined and illustrated. The major assumptions underlying these formulas are…

Descriptors: Item Response Theory, Test Reliability, Evaluation Methods, Computation

The Order-Restricted Association Model: Two Estimation Algorithms and Issues in Testing

Peer reviewed

Direct link

Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004

This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…

Descriptors: Mathematics, Test Reliability, Computation, Testing

Curriculum-Based Measurement of Mathematics Competence: From Computation to Concepts and Applications to Real-Life Problem Solving

Peer reviewed

Direct link

Fuchs, Lynn S.; Fuchs, Douglas; Courey, Susan J. – Assessment for Effective Intervention, 2005

In this article, the authors explain how curriculum-based measurement (CBM) differs from other forms of classroom-based assessment. The development of CBM is traced from computation to concepts and applications to real-life problem solving, with examples of the assessments and illustrations of research to document technical features and utility…

Descriptors: Curriculum Based Assessment, Mathematics Skills, Case Studies, Computation

KeyMath--Revised (KMR).

Bachor, Dan G. – Diagnostique, 1990

KeyMath Revised was devised as a power test for use with students from kindergarten through grade 9. The test is divided into three dimensions: basic concepts, operations, and applications. This paper describes the test's administration, summation of data, standardization, reliability, and validity. (JDD)

Descriptors: Achievement Tests, Computation, Elementary Secondary Education, Mathematical Applications

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Previous Page | Next Page »

Pages: 1 | 2

Computation	16
Test Reliability	16
Evaluation Methods	5
Test Validity	5
Error of Measurement	4
Interrater Reliability	4
Item Response Theory	4
Statistical Analysis	4
Accuracy	3
Mathematical Concepts	3
Mathematics Achievement	3
Measurement Techniques	3
Psychometrics	3
Scoring	3
Simulation	3
Student Evaluation	3
Test Construction	3
Academic Standards	2
Cutting Scores	2
English	2
Foreign Countries	2
Generalizability Theory	2
Goodness of Fit	2
Mathematical Applications	2
Mathematics Skills	2
More ▼

Raykov, Tenko	3
Marcoulides, George A.	2
Bachor, Dan G.	1
Bardhoshi, Gerta	1
Calmettes, Guillaume	1
Courey, Susan J.	1
DeMars, Christine E.	1
Drummond, Gordon B.	1
Dumas, Denis	1
Erford, Bradley T.	1
Fuchs, Douglas	1
Fuchs, Lynn S.	1
Galindo-Garre, Francisca	1
Goldammer, Philippe	1
Gorbunova, Tatiana N.	1
Griph, Gerald W.	1
Huebner, Alan	1
Li, Tatyana	1
Lucht, Marissa	1
McNeish, Daniel	1
Menold, Natalja	1
Oshima, T.C.	1
Raju, Nambury S.	1
Ueno, Maomi	1
Uto, Masaki	1
More ▼