ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	4

Source

Journal of Educational…

Author

Forsyth, Robert A.	2
Albano, Anthony D.	1
Becker, Douglas F.	1
Bradshaw, Laine	1
Cook, Linda L.	1
Finch, W. Holmes	1
French, Brian F.	1
Hambleton, Ronald K.	1
Lord, Frederic M.	1
Madison, Matthew J.	1
Marco, Gary L.	1
Wright, Benjamin D.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Iowa Tests of Educational…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Evaluating Intervention Effects in a Diagnostic Classification Model Framework

Peer reviewed

Direct link

Madison, Matthew J.; Bradshaw, Laine – Journal of Educational Measurement, 2018

The evaluation of intervention effects is an important objective of educational research. One way to evaluate the effectiveness of an intervention is to conduct an experiment that assigns individuals to control and treatment groups. In the context of pretest/posttest designed studies, this is referred to as a control-group pretest/posttest design.…

Descriptors: Intervention, Program Evaluation, Program Effectiveness, Control Groups

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Hierarchical Logistic Regression: Accounting for Multilevel Data in DIF Detection

Peer reviewed

Direct link

French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010

The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…

Descriptors: Test Bias, Testing Programs, Evaluation, Measurement

Linking Response-Time Parameters onto a Common Scale

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational Measurement, 2010

Although response times on test items are recorded on a natural scale, the scale for some of the parameters in the lognormal response-time model (van der Linden, 2006) is not fixed. As a result, when the model is used to periodically calibrate new items in a testing program, the parameter are not automatically mapped onto a common scale. Several…

Descriptors: Test Items, Testing Programs, Measures (Individuals), Item Response Theory

Latent Trait Models and Their Use in the Analysis of Educational Test Data

Peer reviewed

Hambleton, Ronald K.; Cook, Linda L. – Journal of Educational Measurement, 1977

This article presents a non-mathematical introduction to latent trait test models and some of their features. Latent trait models are compared to classical test models. Two promising applications of latent trait models and available computer programs are discussed. (Author/JKS)

Descriptors: Computer Programs, Latent Trait Theory, Measurement, Models

Solving Measurement Problems with the Rasch Model

Peer reviewed

Wright, Benjamin D. – Journal of Educational Measurement, 1977

This article explains the Rasch model for sample-free item analysis and test-free person measurement. It shows how to estimate model parameters from data and how to evaluate the statistical fit of these estimates to the data. Attention is paid to practical considerations. (Author/JKS)

Descriptors: Computer Programs, Latent Trait Theory, Measurement, Models

A Classification Scheme for Methods of Using Student Data to Assess School Effectiveness

Peer reviewed

Marco, Gary L.; And Others – Journal of Educational Measurement, 1976

Special emphasis is given to the kinds of control that can be exercised over initial status, including the use of proxy input data. A rationale for the classification scheme is developed, based on (1) three one-shot, one cross-sectional, and two longitudinal data types and (2) two types of referencing: criterion referencing and norm referencing.…

Descriptors: Classification, Data Collection, Evaluation Methods, Methods

Practical Applications of Item Characteristic Curve Theory

Peer reviewed

Lord, Frederic M. – Journal of Educational Measurement, 1977

A variety of practical applications of item characteristic curve test theory are discussed. Among these applications are tailored testing, two stage testing, determining whether two tests measure the same latent trait, and measuring item bias towards minority or other groups. (Author/JKS)

Descriptors: Computer Programs, Latent Trait Theory, Mastery Tests, Measurement

Some Empirical Results Related to the Stability of Performance Indicators in Dyer's Student Change Model of an Educational System

Peer reviewed

Forsyth, Robert A. – Journal of Educational Measurement, 1973

Article is concerned with a model for school system evaluation. The usefulness of the indices from this model depend on their stability, and this study presents evidence related to their stability when pupils and factors related to time are considered as sources of error. (Author/RK)

Descriptors: Correlation, Educational Quality, Models, Multiple Regression Analysis

An Empirical Investigation of Thurstone and IRT Methods of Scaling Achievement Tests.

Peer reviewed

Becker, Douglas F.; Forsyth, Robert A. – Journal of Educational Measurement, 1992

Measurement scales developed using Thurstone and item-response theory (IRT) methods of scaling achievement tests for the same single-level data were compared for approximately 4,000 high school students taking the Iowa Tests of Educational Development in 1975. Results of both approaches indicate that variability increases as grade level increases.…

Descriptors: Achievement Tests, Age Differences, High School Students, High Schools

Models	10
Measurement	4
Test Construction	4
Computer Programs	3
Item Response Theory	3
Latent Trait Theory	3
Test Bias	3
Test Reliability	3
Testing Programs	3
Classification	2
Evaluation Methods	2
Program Effectiveness	2
Program Evaluation	2
Test Items	2
Achievement Tests	1
Age Differences	1
Cluster Grouping	1
College Entrance Examinations	1
Control Groups	1
Correlation	1
Data Collection	1
Diagnostic Tests	1
Educational Quality	1
Evaluation	1
Experimental Groups	1
More ▼