ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	3

Descriptor

Equated Scores	7
Statistical Analysis	7
Test Length	7
Mathematical Models	3
Sample Size	3
Test Reliability	3
Comparative Analysis	2
Correlation	2
Difficulty Level	2
Item Response Theory	2
Standardized Tests	2
Test Items	2
Ability Identification	1
Accuracy	1
Achievement Tests	1
College Entrance Examinations	1
Efficiency	1
Elementary Secondary Education	1
Essay Tests	1
Estimation (Mathematics)	1
Factor Analysis	1
Foreign Countries	1
Goodness of Fit	1
Item Analysis	1
Language Proficiency	1
More ▼

Source

Applied Measurement in…	1
ETS Research Report Series	1
Educational and Psychological…	1
Journal of Educational…	1

Author

Budescu, David	1
Huggins-Manley, Anne Corinne	1
Hutten, Leah R.	1
Lee, Won-Chan	1
Lim, Euijin	1
Livingston, Samuel A.	1
Qiu, Yuxi	1
Ricker, Kathryn L.	1
de Jong, John H. A. L.	1
von Davier, Alina A.	1

Publication Type

Reports - Research	6
Journal Articles	4
Speeches/Meeting Papers	3
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Subscore Equating and Profile Reporting

Peer reviewed

Direct link

Lim, Euijin; Lee, Won-Chan – Applied Measurement in Education, 2020

The purpose of this study is to address the necessity of subscore equating and to evaluate the performance of various equating methods for subtests. Assuming the random groups design and number-correct scoring, this paper analyzed real data and simulated data with four study factors including test dimensionality, subtest length, form difference in…

Descriptors: Equated Scores, Test Length, Test Format, Difficulty Level

Evaluating the Accuracy of the Empirical Item Characteristic Curve Preequating Method in the Presence of Test Speededness

Peer reviewed

Direct link

Qiu, Yuxi; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019

This study aimed to assess the accuracy of the empirical item characteristic curve (EICC) preequating method given the presence of test speededness. The simulation design of this study considered the proportion of speededness, speededness point, speededness rate, proportion of missing on speeded items, sample size, and test length. After crossing…

Descriptors: Accuracy, Equated Scores, Test Items, Nonparametric Statistics

The Impact of Anchor Test Length on Equating Results in a Nonequivalent Groups Design. Research Report. ETS RR-07-44

Peer reviewed
PDF on ERIC

Download full text

Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007

This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length

Efficiency of Linear Equating as a Function of the Length of the Anchor Test.

Peer reviewed

Budescu, David – Journal of Educational Measurement, 1985

An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)

Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Estimating the Reliability of Classifications Based on Composite Scores.

Download full text

Livingston, Samuel A. – 1984

Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…

Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models

Tailoring Tests to Educational Levels.

Download full text

de Jong, John H. A. L. – 1984

The Netherlands' secondary education system is highly differentiated, with four different school types for four scholastic ability levels. Final examinations must accommodate these four levels, and require a test-independent definition of the intended final ability levels as well as a sample-free evaluation of the range of ability levels at which…

Descriptors: Difficulty Level, Efficiency, Equated Scores, Foreign Countries

A Comparison of the Fit of Empirical Data to Two Latent Trait Models. Report No. 92.

Hutten, Leah R. – 1979

Goodness of fit of raw test score data were compared, using two latent trait models: the Rasch model and the Birnbaum three-parameter logistic model. Data were taken from various achievement tests and the Scholastic Aptitude Test (Verbal). A minimum sample size of 1,000 was required, and the minimum test length was 40 items. Results indicated that…

Descriptors: Ability Identification, Achievement Tests, College Entrance Examinations, Comparative Analysis