ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	11

Descriptor

Computation	13
Simulation	13
Test Reliability	13
Test Items	5
Item Response Theory	4
Scores	4
Accuracy	3
Comparative Analysis	3
Correlation	3
Evaluation Methods	3
Maximum Likelihood Statistics	3
Psychometrics	3
Testing	3
Bayesian Statistics	2
Equations (Mathematics)	2
Error of Measurement	2
Foreign Countries	2
Measurement Techniques	2
Monte Carlo Methods	2
Multivariate Analysis	2
Probability	2
Test Bias	2
Associative Learning	1
Chemistry	1
Classification	1
More ▼

Source

Educational and Psychological…	5
Journal of Educational…	3
European Journal of…	1
IEEE Transactions on Learning…	1
Journal of Chemical Education	1
Journal of Educational and…	1
Psychometrika	1

Publication Type

Journal Articles	13
Reports - Research	10
Reports - Descriptive	3

Education Level

Secondary Education	2
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1

Audience

Location

Russia	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

KR20 and KR21 for Some Nondichotomous Data (It's Not Just Cronbach's Alpha)

Peer reviewed

Direct link

Foster, Robert C. – Educational and Psychological Measurement, 2021

This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…

Descriptors: Test Reliability, Data, Computation, Mathematical Formulas

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

A Flexible Latent Class Approach to Estimating Test-Score Reliability

Peer reviewed

Direct link

van der Palm, Daniël W.; van der Ark, L. Andries; Sijtsma, Klaas – Journal of Educational Measurement, 2014

The latent class reliability coefficient (LCRC) is improved by using the divisive latent class model instead of the unrestricted latent class model. This results in the divisive latent class reliability coefficient (DLCRC), which unlike LCRC avoids making subjective decisions about the best solution and thus avoids judgment error. A computational…

Descriptors: Test Reliability, Scores, Computation, Simulation

Attribute-Level and Pattern-Level Classification Consistency and Accuracy Indices for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Wang, Wenyi; Song, Lihong; Chen, Ping; Meng, Yaru; Ding, Shuliang – Journal of Educational Measurement, 2015

Classification consistency and accuracy are viewed as important indicators for evaluating the reliability and validity of classification results in cognitive diagnostic assessment (CDA). Pattern-level classification consistency and accuracy indices were introduced by Cui, Gierl, and Chang. However, the indices at the attribute level have not yet…

Descriptors: Classification, Reliability, Accuracy, Cognitive Tests

Scale Reliability Evaluation with Heterogeneous Populations

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015

A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…

Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation

Test-Retest Reliability of the Adaptive Chemistry Assessment Survey for Teachers: Measurement Error and Alternatives to Correlation

Peer reviewed

Direct link

Harshman, Jordan; Yezierski, Ellen – Journal of Chemical Education, 2016

Determining the error of measurement is a necessity for researchers engaged in bench chemistry, chemistry education research (CER), and a multitude of other fields. Discussions regarding what constructs measurement error entails and how to best measure them have occurred, but the critiques about traditional measures have yielded few alternatives.…

Descriptors: Science Instruction, Chemistry, Error of Measurement, Psychometrics

Testing Methodology in the Student Learning Process

Peer reviewed
PDF on ERIC

Download full text

Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017

The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…

Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C-Tests

Peer reviewed

Direct link

Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan – Journal of Educational Measurement, 2014

C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…

Descriptors: Comparative Analysis, Psychometrics, Cloze Procedure, Language Tests

Higher Order Testlet Response Models for Hierarchical Latent Traits and Testlet-Based Items

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2013

Both testlet design and hierarchical latent traits are fairly common in educational and psychological measurements. This study aimed to develop a new class of higher order testlet response models that consider both local item dependence within testlets and a hierarchy of latent traits. Due to high dimensionality, the authors adopted the Bayesian…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

The Order-Restricted Association Model: Two Estimation Algorithms and Issues in Testing

Peer reviewed

Direct link

Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004

This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…

Descriptors: Mathematics, Test Reliability, Computation, Testing

The Impact of Missing Data on Sample Reliability Estimates: Implications for Reliability Reporting Practices

Peer reviewed

Direct link

Enders, Craig K. – Educational and Psychological Measurement, 2004

A method for incorporating maximum likelihood (ML) estimation into reliability analyses with item-level missing data is outlined. An ML estimate of the covariance matrix is first obtained using the expectation maximization (EM) algorithm, and coefficient alpha is subsequently computed using standard formulae. A simulation study demonstrated that…

Descriptors: Intervals, Simulation, Test Reliability, Computation

Sijtsma, Klaas	2
Andersson, Björn	1
Chen, Ping	1
Ding, Shuliang	1
Emons, Wilco H. M.	1
Enders, Craig K.	1
Foster, Robert C.	1
Galindo-Garre, Francisca	1
Gorbunova, Tatiana N.	1
Gu, Zhengguo	1
Harshman, Jordan	1
Huang, Hung-Yu	1
Marcoulides, George A.	1
Meng, Yaru	1
Raykov, Tenko	1
Robitzsch, Alexander	1
Schipolowski, Stefan	1
Schroeders, Ulrich	1
Song, Lihong	1
Ueno, Maomi	1
Uto, Masaki	1
Vermunt, Jeroen K.	1
Wang, Wen-Chung	1
Wang, Wenyi	1
Xin, Tao	1
More ▼