ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	10

Descriptor

Educational Testing	16
Evaluation Problems	16
Testing Problems	16
Evaluation Methods	10
Student Evaluation	9
Educational Assessment	7
Psychometrics	7
Measurement	6
Test Validity	6
Evaluation Research	5
Educational Policy	4
Elementary Secondary Education	4
Test Construction	4
Academic Standards	3
Foreign Countries	3
Mathematics Education	3
Standard Setting (Scoring)	3
Standardized Tests	3
Teacher Evaluation	3
Academic Achievement	2
Adaptive Testing	2
Adult Students	2
Content Validity	2
Correlation	2
Criterion Referenced Tests	2
More ▼

Source

Journal of Educational…	3
Measurement:…	2
American Educational Research…	1
British Educational Research…	1
National Research Center on…	1
Research in Education	1
Teachers College Record	1
Thomas B. Fordham Institute	1
Turkish Online Journal of…	1

Publication Type

Journal Articles	10
Reports - Research	6
Reports - Evaluative	4
Opinion Papers	3
Reports - Descriptive	2
ERIC Digests in Full Text	1
ERIC Publications	1
Numerical/Quantitative Data	1

Education Level

Elementary Secondary Education	7
Adult Education	1
Elementary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Higher Education	1
Junior High Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Arizona	1
California	1
Colorado	1
Delaware	1
Ghana	1
Idaho	1
Illinois	1
Indiana	1
Japan	1
Kansas	1
Maine	1
Maryland	1
Massachusetts	1
Michigan	1
Minnesota	1
Montana	1
Nevada	1
New Hampshire	1
New Jersey	1
New Mexico	1
North Dakota	1
Ohio	1
Rhode Island	1
South Carolina	1
Texas	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Advanced Placement…	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Social Epistemology and the Pragmatics of Assessment

Peer reviewed

Direct link

Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014

In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…

Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization

The Leading Group Effect: Illusionary Declines in Scholastic Standard Scores of Mid-Range Japanese Junior High School Pupils

Peer reviewed

Direct link

Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012

Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…

Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems

Different Tests, Different Answers: The Stability of Teacher Value-Added Estimates across Outcome Measures

Peer reviewed

Direct link

Papay, John P. – American Educational Research Journal, 2011

Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…

Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

The Hierarchy Consistency Index: Evaluating Person Fit for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009

In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…

Descriptors: Test Length, Simulation, Correlation, Research Methodology

Door of Hope or Despair: Students' Perception of Distance Education at University of Ghana

Peer reviewed
PDF on ERIC

Download full text

Oteng-Ababio, M. – Turkish Online Journal of Distance Education, 2011

Distance Education has globally become one of the important solutions for increasing admission into the universities, decongesting campuses and efficient utilization of time and space. To ensure the sustainability of the programmes' noble objectives calls for periodic re-evaluation of its modus operandi including the assessment of the perception…

Descriptors: Foreign Countries, Student Attitudes, Distance Education, Negative Attitudes

Look Back in Angoff: A Cautionary Tale.

Peer reviewed

Sizmur, Steve – British Educational Research Journal, 1997

Examines the appropriateness of a cut-off score derived from the Angoff procedure for a reading test in the United Kingdom. Shows that the recommended cut-off score is too low. Suggests ways that standard setting might draw on a range of information to produce appropriate and rationally defensible cut-off scores. (DSK)

Descriptors: Academic Achievement, British National Curriculum, Educational Testing, Elementary Secondary Education

A Second Look at Evaluation for Adult Learners.

Zalon, Margarete Lieb – 1991

Certain types of evaluation tools used in continuing education offerings limit the evaluation process and hence the value of the offering. Asking a yes/no question as an exclusive evaluation method for learners' achievement of objectives is inappropriate for certain types of offerings. Provision should be made for assessment of continuing…

Descriptors: Academic Achievement, Adult Learning, Adult Students, Continuing Education

Validating the MKT Measures: Some Responses to the Commentaries

Peer reviewed

Direct link

Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007

The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…

Descriptors: Educational Testing, Tests, Measurement, Faculty Development

Can Performance-Based Assessments Improve Urban Schooling? ERIC Digest Number 56.

Download full text

Ascher, Carol – 1990

Performance-based assessment has the potential to support a richer curriculum and more accurately assess the skills of low-income minority students than standardized tests. Performance-based assessment has the following advantages: (1) it allows a wide range of expression; (2) it permits assessment of learning in a natural context while students…

Descriptors: Disadvantaged Youth, Educational Innovation, Educational Testing, Elementary Secondary Education

Generalizability and Specificity of Interpretive Arguments: Observations Inspired by the Commentaries

Peer reviewed

Direct link

Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007

In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…

Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement

Intelligence Testing and Cultural Diversity: Concerns, Cautions, and Considerations

Download full text

Ford, Donna Y. – National Research Center on the Gifted and Talented, 2004

With so many unanswered questions and controversies regarding intelligence, testing in general, and testing diverse students in particular, what can educators in gifted education do to ensure that these students have access to and are represented in gifted education programs and services? In this monograph, the author examines test bias by first…

Descriptors: Test Bias, Educational Research, Educational Testing, Standardized Tests

Test and Measurement Expert Opinions: A Dialogue about Testing Students with Disabilities Out of Level in Large-Scale Assessments. Out-of-Level Testing Report.

Download full text

Minnema, Jane; Thurlow, Martha; Bielinski, John – 2002

Two focus groups of test and measurement experts were held to explore the use of out-of-level testing for students with disabilities. The participants (n=17) included state and federal level assessment personnel, test company employees, and university professors. A content analysis of the narrative results indicated that there was no clear…

Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Disabilities

The Proficiency Illusion

Download full text

Cronin, John; Dahlin, Michael; Adkins, Deborah; Kingsbury, G. Gage – Thomas B. Fordham Institute, 2007

At the heart of the No Child Left Behind Act (NCLB) is the call for all students to be "proficient" in reading and mathematics by 2014. Yet the law expects each state to define proficiency as it sees fit and design its own tests. This study investigated three research questions related to this policy: (1) How consistent are various…

Descriptors: Federal Legislation, Mathematics Tests, Test Validity, Reading Tests

Previous Page | Next Page »

Pages: 1 | 2

Bielinski, John	2
Minnema, Jane	2
Thurlow, Martha	2
Adkins, Deborah	1
Ascher, Carol	1
Baldwin, Su G.	1
Clauser, Brian E.	1
Cronin, John	1
Cui, Ying	1
Dahlin, Michael	1
Dillon, Gerard F.	1
Dixon-Román, Ezekiel J.	1
Ford, Donna Y.	1
Gergen, Kenneth J.	1
Hill, Heather C.	1
Kingsbury, G. Gage	1
Leighton, Jacqueline P.	1
Margolis, Melissa J.	1
Mee, Janet	1
Mori, Kazuo	1
Myford, Carol M.	1
Oteng-Ababio, M.	1
Papay, John P.	1
Schilling, Stephen	1
Scott, Jim	1
More ▼