ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	13

Descriptor

Computation	14
Statistical Analysis	14
Equated Scores	6
Models	6
Item Response Theory	4
Accuracy	3
Comparative Analysis	3
Sample Size	3
Simulation	3
Bayesian Statistics	2
Difficulty Level	2
Generalizability Theory	2
Measurement Techniques	2
Rating Scales	2
Scores	2
Statistical Distributions	2
Statistical Inference	2
Test Items	2
Timed Tests	2
Transformations (Mathematics)	2
Academic Standards	1
Adaptive Testing	1
Classification	1
Computer Software	1
Correlation	1
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	13
Reports - Research	8
Reports - Evaluative	4
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Detecting Nonadditivity in Single-Facet Generalizability Theory Applications: Tukey's Test

Peer reviewed

Direct link

Lin, Chih-Kai; Zhang, Jinming – Journal of Educational Measurement, 2018

Under the generalizability-theory (G-theory) framework, the estimation precision of variance components (VCs) is of significant importance in that they serve as the foundation of estimating reliability. Zhang and Lin advanced the discussion of nonadditivity in data from a theoretical perspective and showed the adverse effects of nonadditivity on…

Descriptors: Generalizability Theory, Reliability, Computation, Statistical Analysis

Estimating the Accuracy of Relative Growth Measures Using Empirical Data

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020

The residual gain score has been of historical interest, and its percentile rank has been of interest more recently given its close correspondence to the popular Student Growth Percentile. However, these estimators suffer from low accuracy and systematic bias (bias conditional on prior latent achievement). This article explores three…

Descriptors: Accuracy, Student Evaluation, Measurement Techniques, Evaluation Methods

Statistical Assessment of Estimated Transformations in Observed-Score Equating

Peer reviewed

Direct link

Wiberg, Marie; González, Jorge – Journal of Educational Measurement, 2016

Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…

Descriptors: Statistical Analysis, Equated Scores, Scores, Models

A General Linear Method for Equating with Small Samples

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2015

Research on equating with small samples has shown that methods with stronger assumptions and fewer statistical estimates can lead to decreased error in the estimated equating function. This article introduces a new approach to linear observed-score equating, one which provides flexible control over how form difficulty is assumed versus estimated…

Descriptors: Equated Scores, Sample Size, Sampling, Statistical Inference

A Comparison of IRT Proficiency Estimation Methods under Adaptive Multistage Testing

Peer reviewed

Direct link

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook – Journal of Educational Measurement, 2015

This inquiry is an investigation of item response theory (IRT) proficiency estimators' accuracy under multistage testing (MST). We chose a two-stage MST design that includes four modules (one at Stage 1, three at Stage 2) and three difficulty paths (low, middle, high). We assembled various two-stage MST panels (i.e., forms) by manipulating two…

Descriptors: Comparative Analysis, Item Response Theory, Computation, Accuracy

Statistical Models and Inference for the True Equating Transformation in the Context of Local Equating

Peer reviewed

Direct link

González, B. Jorge; von Davier, Matthias – Journal of Educational Measurement, 2013

Based on Lord's criterion of equity of equating, van der Linden (this issue) revisits the so-called local equating method and offers alternative as well as new thoughts on several topics including the types of transformations, symmetry, reliability, and population invariance appropriate for equating. A remarkable aspect is to define equating…

Descriptors: Equated Scores, Statistical Analysis, Models, Statistical Inference

Situations Where It Is Appropriate to Use Frequency Estimation Equipercentile Equating

Peer reviewed

Direct link

Guo, Hongwen; Oh, Hyeonjoo J.; Eignor, Daniel – Journal of Educational Measurement, 2013

In operational equating situations, frequency estimation equipercentile equating is considered only when the old and new groups have similar abilities. The frequency estimation assumptions are investigated in this study under various situations from both the levels of theoretical interest and practical use. It shows that frequency estimation…

Descriptors: Equated Scores, Computation, Statistical Analysis, Test Items

Some Conceptual Issues in Observed-Score Equating

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2013

In spite of all of the technical progress in observed-score equating, several of the more conceptual aspects of the process still are not well understood. As a result, the equating literature struggles with rather complex criteria of equating, lack of a test-theoretic foundation, confusing terminology, and ad hoc analyses. A return to Lord's…

Descriptors: Equated Scores, Statistical Analysis, Computation, Data Collection

The Random-Effect Generalized Rating Scale Model

Peer reviewed

Direct link

Wang, Wen-Chung; Wu, Shiu-Lien – Journal of Educational Measurement, 2011

Rating scale items have been widely used in educational and psychological tests. These items require people to make subjective judgments, and these subjective judgments usually involve randomness. To account for this randomness, Wang, Wilson, and Shih proposed the random-effect rating scale model in which the threshold parameters are treated as…

Descriptors: Rating Scales, Models, Statistical Analysis, Computation

A Comparison of Item Calibration Procedures in the Presence of Test Speededness

Peer reviewed

Direct link

Suh, Youngsuk; Cho, Sun-Joo; Wollack, James A. – Journal of Educational Measurement, 2012

In the presence of test speededness, the parameter estimates of item response theory models can be poorly estimated due to conditional dependencies among items, particularly for end-of-test items (i.e., speeded items). This article conducted a systematic comparison of five-item calibration procedures--a two-parameter logistic (2PL) model, a…

Descriptors: Response Style (Tests), Timed Tests, Test Items, Item Response Theory

An Analysis of Variance Approach for the Estimation of Response Time Distributions in Tests

Peer reviewed

Direct link

Attali, Yigal – Journal of Educational Measurement, 2010

Generalizability theory and analysis of variance methods are employed, together with the concept of objective time pressure, to estimate response time distributions and the degree of time pressure in timed tests. By estimating response time variance components due to person, item, and their interaction, and fixed effects due to item types and…

Descriptors: Generalizability Theory, Statistical Analysis, Reaction Time, Timed Tests

Factors Affecting the Item Parameter Estimation and Classification Accuracy of the DINA Model

Peer reviewed

Direct link

de la Torre, Jimmy; Hong, Yuan; Deng, Weiling – Journal of Educational Measurement, 2010

To better understand the statistical properties of the deterministic inputs, noisy "and" gate cognitive diagnosis (DINA) model, the impact of several factors on the quality of the item parameter estimates and classification accuracy was investigated. Results of the simulation study indicate that the fully Bayes approach is most accurate when the…

Descriptors: Classification, Computation, Models, Simulation

Selection Strategies for Univariate Loglinear Smoothing Models and Their Effect on Equating Function Accuracy

Peer reviewed

Direct link

Moses, Tim; Holland, Paul W. – Journal of Educational Measurement, 2009

In this study, we compared 12 statistical strategies proposed for selecting loglinear models for smoothing univariate test score distributions and for enhancing the stability of equipercentile equating functions. The major focus was on evaluating the effects of the selection strategies on equating function accuracy. Selection strategies' influence…

Descriptors: Equated Scores, Selection, Statistical Analysis, Models

A Note on the Power of Statistical Tests in the Journal Of Educational Measurement

Peer reviewed

Brewer, James K.; Owen, Patricia W. – Journal of Educational Measurement, 1973

This note presents the results of a survey of the power of statistical tests appearing in the Journal Of Educational Measurement from winter, 1969 through fall, 1971. (Author)

Descriptors: Computation, Educational Research, Guidelines, Probability

Moses, Tim	2
Albano, Anthony D.	1
Attali, Yigal	1
Brewer, James K.	1
Castellano, Katherine E.	1
Cho, Sun-Joo	1
Deng, Weiling	1
Eignor, Daniel	1
González, B. Jorge	1
González, Jorge	1
Guo, Hongwen	1
Holland, Paul W.	1
Hong, Yuan	1
Kim, Sooyeon	1
Lin, Chih-Kai	1
McCaffrey, Daniel F.	1
Oh, Hyeonjoo J.	1
Owen, Patricia W.	1
Suh, Youngsuk	1
Wang, Wen-Chung	1
Wiberg, Marie	1
Wollack, James A.	1
Wu, Shiu-Lien	1
Yoo, Hanwook	1
Zhang, Jinming	1
More ▼