Publication Date
| In 2026 | 0 |
| Since 2025 | 13 |
| Since 2022 (last 5 years) | 48 |
| Since 2017 (last 10 years) | 151 |
| Since 2007 (last 20 years) | 301 |
Descriptor
| Interrater Reliability | 503 |
| Test Reliability | 503 |
| Test Validity | 260 |
| Test Construction | 106 |
| Foreign Countries | 103 |
| Psychometrics | 91 |
| Evaluation Methods | 90 |
| Scores | 67 |
| Correlation | 62 |
| Scoring | 61 |
| Rating Scales | 58 |
| More ▼ | |
Source
Author
| Epstein, Michael H. | 7 |
| Johnson, Evelyn S. | 4 |
| Matson, Johnny L. | 4 |
| Tasse, Marc J. | 4 |
| Aman, Michael G. | 3 |
| Canivez, Gary L. | 3 |
| Capie, William | 3 |
| Conroy, Maureen A. | 3 |
| Crawford, Angela R. | 3 |
| Lecavalier, Luc | 3 |
| McLeod, Bryce D. | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 41 |
| Practitioners | 8 |
| Administrators | 3 |
| Teachers | 3 |
| Counselors | 1 |
Location
| Turkey | 11 |
| Canada | 10 |
| Australia | 9 |
| United Kingdom | 9 |
| Pennsylvania | 7 |
| Florida | 6 |
| Netherlands | 6 |
| Sweden | 5 |
| United Kingdom (England) | 5 |
| China | 4 |
| Illinois | 4 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 1 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Olson, Margot A.; Martin, Diane – 1980
In fall 1978, a study was conducted at Eastfield College to determine the desirability of using a writing sample as part of the student assessment and advisement process. At an orientation session attended by 1,643 entering students, an entry-level assessment was conducted, consisting of a writing sample; an objective English composition test; the…
Descriptors: Academic Achievement, Community Colleges, Essay Tests, Holistic Evaluation
Cason, Gerald J.; And Others – 1987
The Objective Test Scoring and Performance Rating (OTS-PR) system is a fully integrated set of 70 modular FORTRAN programs run on a VAX-8530 computer. Even with no knowledge of computers, the user can implement OTS-PR to score multiple-choice tests, include scores from external sources such as hand-scored essays or scores from nationally…
Descriptors: Clinical Experience, Computer Assisted Testing, Educational Assessment, Essay Tests
Peer reviewedMiller, Jeff – College Teaching, 1999
A college faculty member who has graded Advanced Placement exam essays on U.S. government and politics, taken mostly by high school juniors and seniors, suggests that high school teachers and college faculty who assess the essays are not the best qualified persons to do so and that despite efforts to ensure consistency, the resulting scores are…
Descriptors: Advanced Placement, College Instruction, Essays, Evaluation Criteria
Peer reviewedMcLauchlan, William – College Teaching, 1999
A faculty consultant to the Educational Testing Service for advanced placement (AP) test reading in U.S. government and politics responds to an article criticizing essay evaluation methods and criteria, finding in it a fundamental misunderstanding of the AP reading process and explaining why the essays are subject to less scrutiny for style,…
Descriptors: Advanced Placement, College Instruction, Essays, Evaluation Criteria
New Mexico Public Education Department, 2007
The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…
Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring
Scherer, Marcia J.; McKee, Barbara G. – 1992
Validity and reliability data are presented for two instruments for assessing the predispositions that people have toward the use of assistive and educational technologies. The two instruments, the Assistive Technology Device Predisposition Assessment (ATDPA) and the Educational Technology Predisposition Assessment (ETPA), are self-report…
Descriptors: Assistive Devices (for Disabled), Attitude Measures, Check Lists, College Students
Kaplan, Bruce A.; Johnson, Eugene G. – 1992
Across the field of educational assessment the case has been made for alternatives to the multiple-choice item type. Most of the alternative types of items require a subjective evaluation by a rater. The reliability of this subjective rating is a key component of these types of alternative items. In this paper, measures of reliability are…
Descriptors: Educational Assessment, Elementary Secondary Education, Estimation (Mathematics), Evaluators
Aghbar, Ali-Asghar – 1986
The effectiveness of the "read-comp" technique in assessing writing ability and the usefulness of a rubric and procedure devised for scoring read-comp samples and essays were evaluated. Subjects were 100 freshman students enrolled in general and remedial English classes in a 6-week summer session at Indiana University of Pennsylvania.…
Descriptors: College Freshmen, Essay Tests, Evaluation Methods, Grading
Goldstein, Harvey; Wolf, Alison – 1986
Locally developed occupational tests were administered to 16- and 17-year-olds in a government-sponsored vocational education program in the United Kingdom over a six-month period in 1984. Job skills were tested in two occupational areas: use of a micrometer and invoice completion. Some performance tests were designed by researchers and some by…
Descriptors: Comparative Testing, Criterion Referenced Tests, Evaluation Criteria, Foreign Countries
Cronin, Linda; Capie, William – 1986
The influence of day-to-day variation in teacher performance on the reliability and validity of teacher assessment was examined. An attempt was made to identify and quantify sources of score variation attributable to differences in teacher performance, day of observation, observers, and test subscales; and to determine their effects on reliability…
Descriptors: Behavior Change, Behavior Rating Scales, Classroom Observation Techniques, Evaluation Methods
Peer reviewedBradley, Robert H.; Corwyn, Robert F.; Caldwell, Betty M.; Whiteside-Mansell, Leanne; Mink, Iris T. – Journal of Research on Adolescence, 2000
Describes the development of the Early Adolescent version of the Home Observation for Measurement of the Environment (EA-HOME) Inventory. Presents information on its usefulness with African Americans, Chinese Americans, European Americans, Mexican Americans, and Dominican Americans. Notes findings indicating high interobserver agreement, with…
Descriptors: Black Youth, Child Development, Chinese Americans, Cultural Differences
Littlefield, John H.; And Others – 1985
Sixteen Family Practice faculty members completed ratings on 59 senior medical students after a 6-week primary care clerkship. Each student was rated by seven to ten faculty members and the chief residents who worked with them, resulting in a total of 353 ratings. The rating scale covered: (1) attainment of learning objectives; (2) progress during…
Descriptors: Analysis of Variance, Clinical Experience, Confidence Testing, Evaluators
Carlson, Sybil B.; Camp, Roberta – 1985
This paper reports on Educational Testing Service research studies investigating the parameters critical to reliability and validity in both the direct and indirect writing ability assessment of higher education applicants. The studies involved: (1) formulating an operational definition of writing competence; (2) designing and pretesting writing…
Descriptors: College Entrance Examinations, Computer Assisted Testing, English (Second Language), Essay Tests
Center for Innovation in Assessment (NJ1), 2005
Research was conducted to evaluate how well the "Indiana Reading Assessment--Grade 1" evaluates various reading skills of grade one students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. All the correlations were…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability
Center for Innovation in Assessment (NJ1), 2005
Research was conducted to evaluate how well the "Indiana Reading Assessment--Grade 2" evaluates various reading skills of grade two students. Multiple analyses were conducted; while the results of all the analyses were encouraging, the results derived from the concurrent validity study were most significant. Correlations were either…
Descriptors: Reading Tests, Test Validity, Test Reliability, Interrater Reliability


