Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 19 |
Descriptor
Source
Author
| Bagnato, Stephen J. | 2 |
| Macy, Marisa | 2 |
| Baldwin, Su G. | 1 |
| Bechger, Timo | 1 |
| Beresford, Lauren | 1 |
| Carstensen, Claus H. | 1 |
| Cheng, Liying | 1 |
| Clauser, Brian E. | 1 |
| Collins, Dave | 1 |
| Cui, Ying | 1 |
| Davidson, Anne H. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 27 |
| Opinion Papers | 10 |
| Reports - Evaluative | 8 |
| Reports - Research | 7 |
| Reports - Descriptive | 5 |
Education Level
Audience
| Policymakers | 1 |
| Practitioners | 1 |
| Teachers | 1 |
Location
| Australia | 1 |
| Canada | 1 |
| Denmark | 1 |
| Florida | 1 |
| Germany | 1 |
| Ghana | 1 |
| Japan | 1 |
| United Kingdom | 1 |
| United Kingdom (Scotland) | 1 |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Advanced Placement… | 1 |
| Florida Comprehensive… | 1 |
| Pediatric Evaluation of… | 1 |
| Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Russell, Javarro; Markle, Ross – ETS Research Report Series, 2017
From 2006 to 2008, Educational Testing Service (ETS) produced a series of reports titled "A Culture of Evidence," designed to capture a changing climate in higher education assessment. A decade later, colleges and universities already face new and different challenges resulting from societal, technological, and scientific influences.…
Descriptors: Evidence Based Practice, Evidence, Educational Testing, Educational Improvement
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Ferrara, Steve; Svetina, Dubravka; Skucha, Sylvia; Davidson, Anne H. – Educational Measurement: Issues and Practice, 2011
Items on test score scales located at and below the Proficient cut score define the content area knowledge and skills required to achieve proficiency. Alternately, examinees who perform at the Proficient level on a test can be expected to be able to demonstrate that they have mastered most of the knowledge and skills represented by the items at…
Descriptors: Knowledge Level, Mathematics Tests, Program Effectiveness, Inferences
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009
In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…
Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…
Descriptors: Test Items, Classification, Psychometrics, Item Response Theory
Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009
Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…
Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Bagnato, Stephen J.; Macy, Marisa – NHSA Dialog, 2010
Authentic assessment is a growing alternative to conventional testing. This research-to-practice article describes a framework for implementing authentic assessment. The R-E-A-L framework shows how roles, equipment, assessment tools, and location can be incorporated into early childhood practices.
Descriptors: Early Childhood Education, Performance Based Assessment, Program Implementation, Guidelines
Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009
On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…
Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics
Oteng-Ababio, M. – Turkish Online Journal of Distance Education, 2011
Distance Education has globally become one of the important solutions for increasing admission into the universities, decongesting campuses and efficient utilization of time and space. To ensure the sustainability of the programmes' noble objectives calls for periodic re-evaluation of its modus operandi including the assessment of the perception…
Descriptors: Foreign Countries, Student Attitudes, Distance Education, Negative Attitudes
Macy, Marisa; Bagnato, Stephen J. – NHSA Dialog, 2010
The inclusion of young children with disabilities has remained a function of the Head Start program since its inception in the 1960s when the United States Congress mandated that children with disabilities comprise 10% of the Head Start enrollment (Zigler & Styfco, 2000). Standardized, norm-referenced tests used to identify children with…
Descriptors: Performance Based Assessment, Disadvantaged Youth, Norm Referenced Tests, Disabilities
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
