Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Educational Testing | 16 |
| Evaluation Problems | 16 |
| Testing Problems | 16 |
| Evaluation Methods | 10 |
| Student Evaluation | 9 |
| Educational Assessment | 7 |
| Psychometrics | 7 |
| Measurement | 6 |
| Test Validity | 6 |
| Evaluation Research | 5 |
| Educational Policy | 4 |
| More ▼ | |
Source
Author
| Bielinski, John | 2 |
| Minnema, Jane | 2 |
| Thurlow, Martha | 2 |
| Adkins, Deborah | 1 |
| Ascher, Carol | 1 |
| Baldwin, Su G. | 1 |
| Clauser, Brian E. | 1 |
| Cronin, John | 1 |
| Cui, Ying | 1 |
| Dahlin, Michael | 1 |
| Dillon, Gerard F. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 10 |
| Reports - Research | 6 |
| Reports - Evaluative | 4 |
| Opinion Papers | 3 |
| Reports - Descriptive | 2 |
| ERIC Digests in Full Text | 1 |
| ERIC Publications | 1 |
| Numerical/Quantitative Data | 1 |
Education Level
| Elementary Secondary Education | 7 |
| Adult Education | 1 |
| Elementary Education | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| Higher Education | 1 |
| Junior High Schools | 1 |
| Postsecondary Education | 1 |
| Secondary Education | 1 |
Audience
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Advanced Placement… | 1 |
| Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Mori, Kazuo; Uchida, Akitoshi – Research in Education, 2012
Longitudinal change in the average Z scores for four groups of pupils sorted by quartiles was examined for its stability over three years. The data, collected from 1998 to 2009, was obtained from nine cohorts of Japanese junior high school pupils totaling 1,962 subjects. It showed illusionary declines among the mid-range pupils but improvements…
Descriptors: Foreign Countries, Junior High School Students, Cohort Analysis, Evaluation Problems
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009
In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…
Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Oteng-Ababio, M. – Turkish Online Journal of Distance Education, 2011
Distance Education has globally become one of the important solutions for increasing admission into the universities, decongesting campuses and efficient utilization of time and space. To ensure the sustainability of the programmes' noble objectives calls for periodic re-evaluation of its modus operandi including the assessment of the perception…
Descriptors: Foreign Countries, Student Attitudes, Distance Education, Negative Attitudes
Peer reviewedSizmur, Steve – British Educational Research Journal, 1997
Examines the appropriateness of a cut-off score derived from the Angoff procedure for a reading test in the United Kingdom. Shows that the recommended cut-off score is too low. Suggests ways that standard setting might draw on a range of information to produce appropriate and rationally defensible cut-off scores. (DSK)
Descriptors: Academic Achievement, British National Curriculum, Educational Testing, Elementary Secondary Education
Zalon, Margarete Lieb – 1991
Certain types of evaluation tools used in continuing education offerings limit the evaluation process and hence the value of the offering. Asking a yes/no question as an exclusive evaluation method for learners' achievement of objectives is inappropriate for certain types of offerings. Provision should be made for assessment of continuing…
Descriptors: Academic Achievement, Adult Learning, Adult Students, Continuing Education
Hill, Heather C. – Measurement: Interdisciplinary Research and Perspectives, 2007
The author offers some thoughts on commentator's reactions to the substance of the measures, particularly those about measuring teacher learning and change, based on the major uses of the measures, and because this is a significant challenge facing test development as an enterprise. If teacher learning results in more integrated knowledge or…
Descriptors: Educational Testing, Tests, Measurement, Faculty Development
Ascher, Carol – 1990
Performance-based assessment has the potential to support a richer curriculum and more accurately assess the skills of low-income minority students than standardized tests. Performance-based assessment has the following advantages: (1) it allows a wide range of expression; (2) it permits assessment of learning in a natural context while students…
Descriptors: Disadvantaged Youth, Educational Innovation, Educational Testing, Elementary Secondary Education
Schilling, Stephen – Measurement: Interdisciplinary Research and Perspectives, 2007
In this article, the author echoes his co-author's and colleague's pleasure (Hill, this issue) at the thoughtfulness and far-ranging nature of the comments to their initial attempts at test validation for the mathematical knowledge for teaching (MKT) measures using the validity argument approach. Because of the large number of commentaries they…
Descriptors: Generalizability Theory, Persuasive Discourse, Educational Testing, Measurement
Ford, Donna Y. – National Research Center on the Gifted and Talented, 2004
With so many unanswered questions and controversies regarding intelligence, testing in general, and testing diverse students in particular, what can educators in gifted education do to ensure that these students have access to and are represented in gifted education programs and services? In this monograph, the author examines test bias by first…
Descriptors: Test Bias, Educational Research, Educational Testing, Standardized Tests
Minnema, Jane; Thurlow, Martha; Bielinski, John – 2002
Two focus groups of test and measurement experts were held to explore the use of out-of-level testing for students with disabilities. The participants (n=17) included state and federal level assessment personnel, test company employees, and university professors. A content analysis of the narrative results indicated that there was no clear…
Descriptors: Academic Standards, Adaptive Testing, Criterion Referenced Tests, Disabilities
Cronin, John; Dahlin, Michael; Adkins, Deborah; Kingsbury, G. Gage – Thomas B. Fordham Institute, 2007
At the heart of the No Child Left Behind Act (NCLB) is the call for all students to be "proficient" in reading and mathematics by 2014. Yet the law expects each state to define proficiency as it sees fit and design its own tests. This study investigated three research questions related to this policy: (1) How consistent are various…
Descriptors: Federal Legislation, Mathematics Tests, Test Validity, Reading Tests
Previous Page | Next Page »
Pages: 1 | 2
Direct link
