Publication Date
| In 2026 | 0 |
| Since 2025 | 9 |
| Since 2022 (last 5 years) | 72 |
| Since 2017 (last 10 years) | 173 |
| Since 2007 (last 20 years) | 395 |
Descriptor
Source
Author
| Sireci, Stephen G. | 9 |
| Kitao, Kenji | 4 |
| Kitao, S. Kathleen | 4 |
| Papageorgiou, Spiros | 4 |
| Thurlow, Martha L. | 4 |
| Winnick, Joseph P. | 4 |
| van der Linden, Wim J. | 4 |
| Chang, Hua-Hua | 3 |
| Donovan, Jenny | 3 |
| Ewing, Maureen | 3 |
| Hau, Kit-Tai | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Teachers | 68 |
| Practitioners | 59 |
| Administrators | 20 |
| Students | 15 |
| Policymakers | 9 |
| Researchers | 7 |
| Parents | 6 |
| Counselors | 3 |
| Community | 2 |
| Support Staff | 1 |
Location
| Australia | 18 |
| California | 15 |
| Canada | 14 |
| China | 13 |
| United States | 12 |
| Massachusetts | 9 |
| United Kingdom | 9 |
| Europe | 8 |
| Georgia | 8 |
| Japan | 8 |
| Rhode Island | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Content Characteristics of GRE Analytical Reasoning Items. GRE Board Professional Report No. 84-14P.
Chalifour, Clark; Powers, Donald E. – 1988
In actual test development practice, the number of test items that must be developed and pretested is typically greater, and sometimes much greater, than the number eventually judged suitable for use in operational test forms. This has proven to be especially true for analytical reasoning items, which currently form the bulk of the analytical…
Descriptors: Coding, Difficulty Level, Higher Education, Test Construction
Malcolm, Donald J. – 1992
Various memoranda concerning language test development procedures and technical operations are compiled for staff at the Kuwait University Language Center from the Office of Tests and Measurement. The memoranda are of interest to Unit Test Representatives but also are intended to provide guidance to unit supervisors, course coordinators, and…
Descriptors: Foreign Countries, Higher Education, Language Tests, Standards
White, Charles E., Jr. – 1993
The purpose of this study was to develop and implement a hypertext documentation system in an industrial laboratory and to evaluate its usefulness by participative observation and a questionnaire. Existing word-processing test method documentation was converted directly into a hypertext format or "hyperdocument." The hyperdocument was designed and…
Descriptors: Chemical Industry, Documentation, Hypermedia, Participant Observation
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 2002
Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
Descriptors: Error of Measurement, Reliability, Scores, Test Construction
Peer reviewedDavis, Laurie Laughlin; Pastor, Dena A.; Dodd, Barbara G.; Chiang, Claire; Fitzpatrick, Steven J. – Journal of Applied Measurement, 2003
Examined the effectiveness of the Sympson-Hetter technique and rotated content balancing relative to no exposure control and no content rotation conditions in a computerized adaptive testing system based on the partial credit model. Simulation results show the Sympson-Hetter technique can be used with minimal impact on measurement precision,…
Descriptors: Adaptive Testing, Computer Assisted Testing, Selection, Simulation
Peer reviewedLeMahieu, Paul G. – NASSP Bulletin, 1992
The value of assessment activities should be judged by their contribution to what happens, directly or indirectly, between teachers and students. No one assessment can serve all ends. Alternative forms of assessment measure outcomes beyond the purview of traditional measures and are more authentic efforts to represent behavior or accomplishments…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Student Evaluation, Test Construction
Peer reviewedRaymond, Mark R. – Applied Measurement in Education, 2001
Reviews general approaches to job analysis and considers methodological issues related to sampling and the development of rating scales used to measure and describe a profession or occupation. Evaluates the usefulness of different types of test plans and describes judgmental and empirical methods for using practice analysis data to help develop…
Descriptors: Certification, Job Analysis, Licensing Examinations (Professions), Rating Scales
Duckworth, Jane C.; Anderson, Wayne P. – 1986
This manual presents information on the Minnesota Multiphasic Personality Inventory (MMPI), primarily directed to counselors and clinicians who work with university counseling center clients, private practice clients, and mental health clinic clients who are not usually psychotic or neurotic but are having difficulties in one or two areas. This…
Descriptors: Higher Education, Personality Measures, Test Content, Test Interpretation
Childs, Ruth A.; Jaciw, Andrew P. – 2003
This Digest describes matrix sampling of test items as an approach to achieving broad coverage while minimizing testing time per student. Matrix sampling involves developing a complete set of items judged to cover the curriculum, then dividing the items into subsets and administering one subset to each student. Matrix sampling, by limiting the…
Descriptors: Item Banks, Matrices, Sampling, Test Construction
Georgia State Dept. of Education, Atlanta. – 1999
This document contains a description of the Georgia High School Graduation Test in mathematics. The test item specifications, reflecting the Georgia State Quality Core Curriculum, are used by writers and reviewers who are responsible for the development of test items. Much of the content in the description is based on earlier test versions…
Descriptors: High School Students, High Schools, Mathematics, Standardized Tests
Leung, Chi-Keung; Chang, Hua-Hua; Hau, Kit-Tai – 2000
Item selection methods in computerized adaptive testing (CAT) can yield extremely skewed item exposure distribution in which items with high "a" values may be over-exposed while those with low "a" values may never be selected. H. Chang and Z. Ying (1999) proposed the a-stratified design (ASTR) that attempts to equalize item…
Descriptors: Adaptive Testing, Computer Assisted Testing, Selection, Test Construction
Michigan State Dept. of Education, Lansing. Michigan Educational Assessment Program. – 1998
Designed to provide an experience as close as possible to the actual assessment, this paper presents the revised model of assessment in reading for the Michigan High School Test (HST). The revisions incorporated into the paper reflect the testing transition from the High School Proficiency Test to the HST. The first part of the paper presents…
Descriptors: High Schools, Reading Achievement, Reading Skills, Reading Tests
Walker, Karen – Education Partnerships, Inc., 2006
What are the purposes of exams? Would teachers give exams if they were not grading students? Exams provide information that should inform the instructional program, let the students know their strengths as well as areas for growth, and tell teachers what information their students know and what they still need to know. When determining how…
Descriptors: Tests, Test Items, Test Construction, Teacher Made Tests
Peer reviewedLeung, Chi-Keung; Chang, Hua-Hua; Hau, Kit-Tai – Educational and Psychological Measurement, 2003
Studied three stratification designs for computerized adaptive testing in conjunction with three well-developed content balancing methods. Simulation study results show substantial differences in item overlap rate and pool utilization among different methods. Recommends an optimal combination of stratification design and content balancing method.…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Simulation
Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009
In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…
Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Direct link
