ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	34

Descriptor

Educational Assessment	37
Evaluation Problems	37
Measurement	37
Evaluation Methods	34
Measurement Techniques	24
Psychometrics	23
Evaluation Research	14
Student Evaluation	13
Teacher Evaluation	13
Educational Testing	12
Models	12
Test Validity	12
Testing Problems	12
Test Construction	10
Knowledge Base for Teaching	9
Mathematics Education	9
Mathematics Instruction	9
Pedagogical Content Knowledge	9
Evidence	8
Academic Achievement	7
Item Response Theory	7
Diagnostic Tests	6
Educational Policy	6
Educational Research	6
Program Effectiveness	6
More ▼

Source

Measurement:…	14
Education Finance and Policy	3
Journal of Applied Testing…	3
Journal of Educational…	3
School Psychology Review	3
National Center for Analysis…	2
ECNU Review of Education	1
Educational Assessment,…	1
Educational Measurement:…	1
Educational Psychology: An…	1
International Journal of…	1
Measurement in Physical…	1
Scholar-Practitioner Quarterly	1
TNTP	1
Teaching in Higher Education	1
More ▼

Publication Type

Journal Articles	34
Opinion Papers	16
Reports - Descriptive	10
Reports - Evaluative	6
Reports - Research	4
Information Analyses	1
Reports - General	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	27
Elementary Education	6
Higher Education	2
Adult Education	1
Grade 4	1
Grade 5	1
Grade 8	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Practitioners

Location

New York	2
California	1
Florida	1
Illinois	1
New Jersey	1
North Carolina	1
Tennessee	1
Texas	1
United Kingdom (England)	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Advanced Placement…

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Tackling the Wicked Problem of Measuring What Matters: Framing the Questions

Peer reviewed
PDF on ERIC

Download full text

Direct link

Zhao, Yong; Wehmeyer, Michael; Basham, James; Hansen, David – ECNU Review of Education, 2019

Purpose: Making policy makers, researcher, education leaders, and assessment developers aware that what matters in education assessment is a wicked problem that cannot be easily solved following traditional approaches. Design/Approach/Methods: Starting from the questions that what matters in education assessment, this article presented such…

Descriptors: Educational Assessment, Evaluation Problems, Outcomes of Education, Cooperation

Myths & Facts about Value-Added Analysis

Download full text

TNTP, 2011

This paper presents myths as well as facts about value-added analysis. These myths include: (1) "Value-added isn't fair to teachers who work in high-need schools, where students tend to lag far behind academically"; (2) "Value-added scores are too volatile from year-to-year to be trusted"; (3) "There's no research behind value-added"; (4) "Using…

Descriptors: Academic Achievement, Standardized Tests, Teacher Evaluation, Evaluation Methods

PE Metrics: Background, Testing Theory, and Methods

Peer reviewed

Direct link

Zhu, Weimo; Rink, Judy; Placek, Judith H.; Graber, Kim C.; Fox, Connie; Fisette, Jennifer L.; Dyson, Ben; Park, Youngsik; Avery, Marybell; Franck, Marian; Raynes, De – Measurement in Physical Education and Exercise Science, 2011

New testing theories, concepts, and psychometric methods (e.g., item response theory, test equating, and item bank) developed during the past several decades have many advantages over previous theories and methods. In spite of their introduction to the field, they have not been fully accepted by physical educators. Further, the manner in which…

Descriptors: Physical Education, Quality Control, Psychometrics, Item Response Theory

Diagnostic Classification Models: Which One Should I Use?

Peer reviewed

Direct link

Jiao, Hong – Measurement: Interdisciplinary Research and Perspectives, 2009

Diagnostic assessment is currently an active research area in educational measurement. Literature related to diagnostic modeling has been in existence for several decades, but a great deal of research has been conducted within the last decade or so, especially within the last five years. The author summarizes the key components in the application…

Descriptors: Educational Assessment, Literature Reviews, Test Items, Probability

Value-Added Measures for Schools in England: Looking inside the "Black Box" of Complex Metrics

Peer reviewed

Direct link

Kelly, Anthony; Downey, Christopher – Educational Assessment, Evaluation and Accountability, 2010

Value-added measures can be used to allocate funding to schools, to identify those institutions in need of special attention and to underpin government guidance on targets. In England, there has been a tendency to include in these measures an ever-greater number of contextualising variables and to develop ever-more complex models that encourage…

Descriptors: School Effectiveness, Foreign Countries, Academic Achievement, Educational Finance

Learning Outcomes: A Conceptual Analysis

Peer reviewed

Direct link

Hussey, Trevor; Smith, Patrick – Teaching in Higher Education, 2008

Learning outcomes have become widely used in higher education, but also misused to the point of being controversial and a bureaucratic burden. This paper distinguishes three kinds of learning outcome found in current literature: (1) those used in individual teaching events; (2) those specified for modules or short courses; and (3) those specified…

Descriptors: Higher Education, Outcomes of Education, Educational Assessment, Evaluation Methods

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

Student Sorting and Bias in Value-Added Estimation: Selection on Observables and Unobservables

Peer reviewed

Direct link

Rothstein, Jesse – Education Finance and Policy, 2009

Nonrandom assignment of students to teachers can bias value-added estimates of teachers' causal effects. Rothstein (2008, 2010) shows that typical value-added models indicate large counterfactual effects of fifth-grade teachers on students' fourth-grade learning, indicating that classroom assignments are far from random. This article quantifies…

Descriptors: Grade 5, Academic Achievement, Student Placement, Educational Assessment

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Diagnostic Models as Partially Ordered Sets

Peer reviewed

Direct link

Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…

Descriptors: Test Items, Classification, Psychometrics, Item Response Theory

Equivalent Diagnostic Classification Models

Peer reviewed

Direct link

Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009

Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…

Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

Impediments to the Estimation of Teacher Value Added

Peer reviewed

Direct link

Ishii, Jun; Rivkin, Steven G. – Education Finance and Policy, 2009

This article considers potential impediments to the estimation of teacher quality caused primarily by the purposeful behavior of families, administrators, and teachers. The discussion highlights the benefits of accounting for student and school differences through a value-added modeling approach that incorporates a student's history of family,…

Descriptors: Teacher Effectiveness, Educational Quality, Barriers, Student Characteristics

Previous Page | Next Page »

Pages: 1 | 2 | 3

Rivkin, Steven G.	2
Alonzo, Alicia C.	1
Avery, Marybell	1
Baldwin, Su G.	1
Basham, James	1
Bechger, Timo	1
Boyd, Donald	1
Buckle, C. F.	1
Camara, Wayne	1
Carstensen, Claus H.	1
Clauser, Brian E.	1
Cline, Frederick	1
Cook, Linda	1
Cui, Ying	1
DiBello, Lou	1
Dillon, Gerard F.	1
Downey, Christopher	1
Dyson, Ben	1
Easton, Julia E.	1
Engelhard, George, Jr.	1
Ferrara, Steve	1
Fisette, Jennifer L.	1
Fox, Connie	1
Franck, Marian	1
Frey, Andreas	1
More ▼