Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 5 |
Descriptor
| Evaluation Methods | 9 |
| Item Analysis | 9 |
| Testing Programs | 9 |
| Item Response Theory | 4 |
| Standardized Tests | 3 |
| Test Items | 3 |
| Test Results | 3 |
| Accountability | 2 |
| Achievement Tests | 2 |
| Criterion Referenced Tests | 2 |
| Educational Objectives | 2 |
| More ▼ | |
Author
| Albano, Anthony D. | 1 |
| Brian F. French | 1 |
| Carlson, Janet F. | 1 |
| Chen, Hanwei | 1 |
| Cui, Zhongmin | 1 |
| Gao, Xiaohong | 1 |
| Geisinger, Kurt F. | 1 |
| Grosswald, Jules | 1 |
| Keating, Xiaofen Deng | 1 |
| Phillips, Gary W. | 1 |
| Thao Thu Vo | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 6 |
| Journal Articles | 5 |
| Reports - Evaluative | 2 |
| Numerical/Quantitative Data | 1 |
| Reports - Descriptive | 1 |
Education Level
| Adult Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 11 | 1 |
| High Schools | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Record Examinations | 1 |
What Works Clearinghouse Rating
Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Albano, Anthony D. – Journal of Educational Measurement, 2013
In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…
Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Carlson, Janet F.; Geisinger, Kurt F. – International Journal of Testing, 2012
The test review process used by the Buros Center for Testing is described as a series of 11 steps: (1) identifying tests to be reviewed, (2) obtaining tests and preparing test descriptions, (3) determining whether tests meet review criteria, (4) identifying appropriate reviewers, (5) selecting reviewers, (6) sending instructions and materials to…
Descriptors: Testing, Test Reviews, Evaluation Methods, Evaluation Criteria
Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010
The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…
Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level
Grosswald, Jules – 1975
Much of the intrinsic wealth of planning and instructional information available from achievement testing programs goes untapped in typical reporting procedures. Large-scale programs reporting only pupil scores and the results of aggregating those scores stop far short of the purposes intended and fail to realize the potential of such information.…
Descriptors: Achievement Tests, Data Analysis, Decision Making, Evaluation Methods
Keating, Xiaofen Deng – Quest, 2003
This paper aims to examine current nationwide youth fitness test programs, address problems embedded in the programs, and possible solutions. The current Fitnessgram, President's Challenge, and YMCA youth fitness test programs were selected to represent nationwide youth fitness test programs. Sponsors of the nationwide youth fitness test programs…
Descriptors: Physical Education, Test Items, Physical Fitness, Youth Programs
Wall, Janet – 1978
In September 1977 a reading and language arts test, based upon 40 objectives slated for student accomplishment by the end of grade four, was administered to all Delaware fourth graders. This test, the Objective-Referenced Measure in Communications, was intended to provide entry level diagnostic information to classroom teachers and to inform…
Descriptors: Accountability, Criterion Referenced Tests, Educational Assessment, Educational Objectives
Wahlstrom, Merlin W.; And Others – 1977
Evaluation instruments and current practices used in measuring achievement in Ontario, Canada elementary schools were examined. An analytical review of currently available achievement measures was made in relation to the objectives of "The Formative Years", a circular issued on the Ontario Minestry on Education. Results of item analyses…
Descriptors: Achievement Tests, Basic Skills, Board of Education Policy, Case Studies

Peer reviewed
Direct link
