ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	4

Descriptor

Difficulty Level	8
Error of Measurement	8
Test Format	8
Test Items	7
Item Response Theory	3
Mathematics Tests	3
Test Construction	3
Test Reliability	3
Ability	2
Equated Scores	2
Estimation (Mathematics)	2
Mathematics Achievement	2
Simulation	2
Test Validity	2
Ability Grouping	1
Achievement Gains	1
Achievement Tests	1
College Entrance Examinations	1
Data	1
Disabilities	1
Elementary School Students	1
English (Second Language)	1
English Language Learners	1
Foreign Countries	1
Fractions	1
More ▼

Source

American Institutes for…	1
Educational and Psychological…	1
Grantee Submission	1
International Journal of…	1
Practical Assessment,…	1

Author

Catts, Ralph M.	1
Cole, Ki Lynn	1
Colton, Dean A.	1
DeStefano, Lizanne	1
Griffith, William D.	1
Henning, Grant	1
Inga Laukaityte	1
Johnson, Jeremiah	1
Kim, Sohee	1
Li, Yuan H.	1
Liu, Sicong	1
Marie Wiberg	1
Mwavita, Mwarumba	1
Paek, Insu	1
Schoen, Robert C.	1
Straton, Ralph G.	1
Tam, Hak P.	1
Yang, Xiaotong	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	3
Reports - Evaluative	2
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Grade 4	2
Early Childhood Education	1
Grade 3	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1

Audience

Location

United Kingdom (Wales)

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Impacts of Differences in Group Abilities and Anchor Test Features on Three Non-IRT Test Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Inga Laukaityte; Marie Wiberg – Practical Assessment, Research & Evaluation, 2024

The overall aim was to examine effects of differences in group ability and features of the anchor test form on equating bias and the standard error of equating (SEE) using both real and simulated data. Chained kernel equating, Postratification kernel equating, and Circle-arc equating were studied. A college admissions test with four different…

Descriptors: Ability Grouping, Test Items, College Entrance Examinations, High Stakes Tests

FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions

Peer reviewed

Direct link

Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018

This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…

Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

Study of the Feasibility of a NAEP Mathematics Accessible Block Alternative

Download full text

DeStefano, Lizanne; Johnson, Jeremiah – American Institutes for Research, 2013

This paper describes one of the first efforts by the National Assessment of Educational Progress (NAEP) to improve measurement at the lower end of the distribution, including measurement for students with disabilities (SD) and English language learners (ELLs). One way to improve measurement at the lower end is to introduce one or more…

Descriptors: National Competency Tests, Measures (Individuals), Disabilities, English Language Learners

Equating Multiple Tests via an IRT Linking Design: Utilizing a Single Set of Anchor Items with Fixed Common Item Parameters during the Calibration Process.

Download full text

Li, Yuan H.; Griffith, William D.; Tam, Hak P. – 1997

This study explores the relative merits of a potentially useful item response theory (IRT) linking design: using a single set of anchor items with fixed common item parameters (FCIP) during the calibration process. An empirical study was conducted to investigate the appropriateness of this linking design using 6 groups of students taking 6 forms…

Descriptors: Ability, Difficulty Level, Equated Scores, Error of Measurement

A Comparison of Two, Three and Four-Choice Item Tests Given a Fixed Total Number of Choices.

Peer reviewed

Straton, Ralph G.; Catts, Ralph M. – Educational and Psychological Measurement, 1980

Multiple-choice tests composed entirely of two-, three-, or four-choice items were investigated. Results indicated that number of alternatives per item was inversely related to item difficulty, but directly related to item discrimination. Reliability and standard error of measurement of three-choice item tests was equivalent or superior.…

Descriptors: Difficulty Level, Error of Measurement, Foreign Countries, Higher Education

A Multivariate Generalizability Analysis of the 1989 and 1990 AAP Mathematics Test Forms with Respect to the Table of Specifications.

Download full text

Colton, Dean A. – 1993

Tables of specifications are used to guide test developers in sampling items and maintaining consistency from form to form. This paper is a generalizability study of the American College Testing Program (ACT) Achievement Program Mathematics Test (AAP), with the content areas of the table of specifications representing multiple dependent variables.…

Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Generalizability Theory

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)