ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	5

Descriptor

Evaluation Methods	9
Item Analysis	9
Testing Programs	9
Item Response Theory	4
Standardized Tests	3
Test Items	3
Test Results	3
Accountability	2
Achievement Tests	2
Criterion Referenced Tests	2
Educational Objectives	2
Equated Scores	2
Evaluation Problems	2
Language Arts	2
Mathematics Tests	2
Models	2
Program Descriptions	2
School Districts	2
State Programs	2
Test Reviews	2
Testing	2
Ability Grouping	1
Audits (Verification)	1
Basic Skills	1
Board of Education Policy	1
More ▼

Source

Applied Measurement in…	2
ACT, Inc.	1
International Journal of…	1
Journal of Educational…	1
Quest	1

Author

Albano, Anthony D.	1
Brian F. French	1
Carlson, Janet F.	1
Chen, Hanwei	1
Cui, Zhongmin	1
Gao, Xiaohong	1
Geisinger, Kurt F.	1
Grosswald, Jules	1
Keating, Xiaofen Deng	1
Phillips, Gary W.	1
Thao Thu Vo	1
Tony Albano	1
Wahlstrom, Merlin W.	1
Wall, Janet	1
Zhu, Rongchun	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	5
Reports - Evaluative	2
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Adult Education	1
Elementary Secondary Education	1
Grade 11	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Canada	1
Delaware	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Test Reviewing at the Buros Center for Testing

Peer reviewed

Direct link

Carlson, Janet F.; Geisinger, Kurt F. – International Journal of Testing, 2012

The test review process used by the Buros Center for Testing is described as a series of 11 steps: (1) identifying tests to be reviewed, (2) obtaining tests and preparing test descriptions, (3) determining whether tests meet review criteria, (4) identifying appropriate reviewers, (5) selecting reviewers, (6) sending instructions and materials to…

Descriptors: Testing, Test Reviews, Evaluation Methods, Evaluation Criteria

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

Large-Scale Standardized Testing Programs--New Vistas in User Oriented Reporting.

Download full text

Grosswald, Jules – 1975

Much of the intrinsic wealth of planning and instructional information available from achievement testing programs goes untapped in typical reporting procedures. Large-scale programs reporting only pupil scores and the results of aggregating those scores stop far short of the purposes intended and fail to realize the potential of such information.…

Descriptors: Achievement Tests, Data Analysis, Decision Making, Evaluation Methods

The Current Often Implemented Fitness Tests in Physical Education Programs: Problems and Future Directions

Peer reviewed

Direct link

Keating, Xiaofen Deng – Quest, 2003

This paper aims to examine current nationwide youth fitness test programs, address problems embedded in the programs, and possible solutions. The current Fitnessgram, President's Challenge, and YMCA youth fitness test programs were selected to represent nationwide youth fitness test programs. Sponsors of the nationwide youth fitness test programs…

Descriptors: Physical Education, Test Items, Physical Fitness, Youth Programs

The Objective-Referenced Testing Component of the Delaware Educational Accountability System., Grade Four-Communications.

Download full text

Wall, Janet – 1978

In September 1977 a reading and language arts test, based upon 40 objectives slated for student accomplishment by the end of grade four, was administered to all Delaware fourth graders. This test, the Objective-Referenced Measure in Communications, was intended to provide entry level diagnostic information to classroom teachers and to inform…

Descriptors: Accountability, Criterion Referenced Tests, Educational Assessment, Educational Objectives

Measuring Achievement at the Primary and Junior Levels: An Analytical Review of Test Instruments Used in Evaluating Pupil Achievement and of Communicating Results to Parents.

Wahlstrom, Merlin W.; And Others – 1977

Evaluation instruments and current practices used in measuring achievement in Ontario, Canada elementary schools were examined. An analytical review of currently available achievement measures was made in relation to the objectives of "The Formative Years", a circular issued on the Ontario Minestry on Education. Results of item analyses…

Descriptors: Achievement Tests, Basic Skills, Board of Education Policy, Case Studies