Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Reader Response | 8 |
| Test Reliability | 8 |
| Test Validity | 8 |
| Evaluation Methods | 3 |
| Achievement Gains | 2 |
| Criticism | 2 |
| Evidence | 2 |
| Interrater Reliability | 2 |
| Program Validation | 2 |
| Rating Scales | 2 |
| Research Reports | 2 |
| More ▼ | |
Source
| British Educational Research… | 1 |
| Cognitive Science | 1 |
| Journal of Language and… | 1 |
| Journal of Teacher Education | 1 |
| Online Submission | 1 |
| Practical Assessment,… | 1 |
| RAND Corporation | 1 |
Author
Publication Type
| Journal Articles | 6 |
| Reports - Descriptive | 4 |
| Reports - Research | 3 |
| Opinion Papers | 2 |
| Reports - Evaluative | 1 |
Education Level
| Adult Education | 1 |
| Elementary Education | 1 |
| Grade 3 | 1 |
| High Schools | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Location
| Florida | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Goldstein, Harvey – British Educational Research Journal, 2015
A response is made to a paper that urges the use of the Rasch model for educational assessment. This paper argues that the model is inadequate and that claims for its efficacy are exaggerated and technically weak.
Descriptors: Reader Response, Item Response Theory, Educational Assessment, Evaluation Methods
Bertenthal, Bennett I.; Scheutz, Matthias – Cognitive Science, 2013
Cooper et al. (this issue) develop an interactive activation model of spatial and imitative compatibilities that simulates the key results from Catmur and Heyes (2011) and thus conclude that both compatibilities are mediated by the same processes since their single model can predict all the results. Although the model is impressive, the…
Descriptors: Models, Test Validity, Test Reliability, Reader Response
Gargani, John; Strong, Michael – Journal of Teacher Education, 2015
In Gargani and Strong (2014), we describe The Rapid Assessment of Teacher Effectiveness (RATE), a new teacher evaluation instrument. Our account of the validation research associated with RATE inspired a review by Good and Lavigne (2015). Here, we reply to the main points of their review. We elaborate on the validity, reliability, theoretical…
Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Evaluation Methods
Pane, John F.; Griffin, Beth Ann; McCaffrey, Daniel F.; Karam, Rita – RAND Corporation, 2014
This addendum to previously published results presents alternative analyses of data from large-scale effectiveness studies of Cognitive Tutor Algebra I in middle schools and high schools. These alternative analyses produce results that are substantively the same as previously reported. We find a significant positive effect of 0.21 standard…
Descriptors: Algebra, Statistical Significance, Pretests Posttests, Reader Response
Williams, Matt N.; Gomez Grajales, Carlos Alberto; Kurkiewicz, Dason – Practical Assessment, Research & Evaluation, 2013
In 2002, an article entitled "Four assumptions of multiple regression that researchers should always test" by Osborne and Waters was published in "PARE." This article has gone on to be viewed more than 275,000 times (as of August 2013), and it is one of the first results displayed in a Google search for "regression…
Descriptors: Multiple Regression Analysis, Misconceptions, Reader Response, Predictor Variables
Williams, Lunetta M.; Hall, Katrina W.; Hedrick, Wanda B.; Lamkin, Marcia; Abendroth, Jennifer – Journal of Language and Literacy Education, 2013
The purpose of the present study was to develop an instrument to measure reading during in-school independent reading (ISIR). Procedures to establish validity and reliability of the instrument included videotaping and observing students during ISIR, gathering feedback from literacy experts, establishing interrater reliability, crosschecking…
Descriptors: Test Construction, Test Validity, Test Reliability, Video Technology
Phelps, Richard P. – Online Submission, 2005
John J. Cannell's late 1980s "Lake Wobegon" reports suggested widespread deliberate educator manipulation of norm-referenced standardized test (NRT) administrations and results, resulting in artificial test score gains. The Cannell studies have been referenced in education research since, but as evidence that high stakes (and not cheating or lax…
Descriptors: Testing Programs, Achievement Gains, Standardized Tests, Norm Referenced Tests
Fuller, Deborah Ann – 1987
A case study examined the effects of primary trait scoring on evaluating seventh grade students' compositions. Primary trait scoring as used in the Metropolitan School District of Washington Township in Indianapolis is innovative in that (1) the scoring guide is assignment specific, written after adult raters read and discussed many papers; (2)…
Descriptors: Case Studies, Curriculum Evaluation, Elementary Education, Evaluation Criteria

Peer reviewed
Direct link
