Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 37 |
Descriptor
| Educational Assessment | 63 |
| Evaluation Problems | 63 |
| Evaluation Methods | 45 |
| Student Evaluation | 33 |
| Educational Testing | 26 |
| Measurement | 25 |
| Psychometrics | 22 |
| Measurement Techniques | 21 |
| Testing Problems | 20 |
| Academic Achievement | 19 |
| Test Validity | 18 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 3 |
| Bagnato, Stephen J. | 2 |
| Bielinski, John | 2 |
| Ferrara, Steve | 2 |
| Macy, Marisa | 2 |
| Minnema, Jane | 2 |
| Alonzo, Alicia C. | 1 |
| Anderson, Patricia S. | 1 |
| Applegate, Brooks | 1 |
| Baker, Eva L. | 1 |
| Baldwin, Su G. | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 2 |
| Policymakers | 1 |
| Researchers | 1 |
Location
| New York | 3 |
| Florida | 2 |
| Alaska | 1 |
| Australia | 1 |
| California | 1 |
| Canada | 1 |
| Connecticut | 1 |
| Georgia | 1 |
| Hawaii | 1 |
| Idaho | 1 |
| Illinois | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 5 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
| Advanced Placement… | 1 |
| Florida Comprehensive… | 1 |
| Iowa Tests of Educational… | 1 |
| Medical College Admission Test | 1 |
| National Assessment of… | 1 |
| Pediatric Evaluation of… | 1 |
| Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Imlig, Flavian; Ender, Susanne – Assessment in Education: Principles, Policy & Practice, 2018
This article reveals three emerging areas of conflict in the use of educational assessment instruments in compulsory education in Switzerland and outlines an analytical approach for detecting and analysing these areas of conflict. The approach combines a conceptual perspective, an evaluation perspective and a teaching perspective to show the…
Descriptors: Foreign Countries, Conflict, Educational Assessment, Compulsory Education
Haertel, Edward – Teachers College Record, 2014
Background:This brief reflection on the work of the Gordon Commission calls out significant themes and implications found in the various papers authored by the commissioners and other scholars, especially those included in this special issue of Teachers College Record. Purpose: The forward-looking vision of the Gordon Commission is contrasted with…
Descriptors: Evaluation, Evaluation Methods, Evaluation Needs, Planning Commissions
Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011
Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…
Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences
Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009
On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…
Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009
In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…
Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Russell, Michael; Kavanaugh, Maureen; Masters, Jessica; Higgins, Jennifer; Hoffmann, Thomas – Journal of Applied Testing Technology, 2009
Many students who are deaf or hard-of-hearing are eligible for a signing accommodation for state and other standardized tests. The signing accommodation, however, presents several challenges for testing programs that attempt to administer tests under standardized conditions. One potential solution for many of these challenges is the use of…
Descriptors: Testing Programs, Student Attitudes, Standardized Tests, Academic Achievement
Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009
Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…
Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability
Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007
The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…
Descriptors: Educational Testing, Tests, Measurement, Faculty Development
Minnema, Jane; Thurlow, Martha; Bielinski, John – 2002
Two focus groups of test and measurement experts were held to explore the use of out-of-level testing for students with disabilities. The participants (n=17) included state and federal level assessment personnel, test company employees, and university professors. A content analysis of the narrative results indicated that there was no clear…
Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Disabilities
Hughes, Katherine L.; Scott-Clayton, Judith – Community College Research Center, Columbia University, 2010
Placement exams are high-stakes assessments that determine many students' college trajectories. More than half of entering students at community colleges are placed into developmental education in at least one subject, based primarily on scores from these assessments, yet recent research fails to find evidence that placement into remediation…
Descriptors: Community Colleges, Remedial Instruction, Literature Reviews, High Stakes Tests
Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…
Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement
Bielinski, John; Thurlow, Martha; Minnema, Jane; Scott, Jim – 2002
In this study, special education teachers identified students with learning disabilities who were working on math skills usually taught two grades below the grade in which the student was enrolled. Each student (n=33) took two levels of the MAT/7 math computation test, an on-grade test, and an out-of-level test intended for students two grades…
Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Educational Assessment

Peer reviewed
Direct link
