Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Curren, Randall R. – Theory and Research in Education, 2004
This article addresses the capacity of high stakes tests to measure the most significant kinds of learning. It begins by examining a set of philosophical arguments pertaining to construct validity and alleged conceptual obstacles to attributing specific knowledge and skills to learners. The arguments invoke philosophical doctrines of holism and…
Descriptors: Test Items, Educational Testing, Construct Validity, High Stakes Tests
Veldhuis-Diermanse, A. E.; Biemans, H. J. A.; Mulder, M.; Mahdizadeh, H. – Journal of Agricultural Education and Extension, 2006
Networked learning aims to foster students' knowledge construction processes as well as the quality of knowledge construction. In this respect, it is crucial to be able to analyse both aspects of networked learning. Based on theories on networked learning and the empirical work of relevant authors in this domain, two coding schemes are presented…
Descriptors: Land Use, Interrater Reliability, Educational Practices, Learning Processes
Meyen, Edward; Bui, Yvonne N. – Journal of Technology and Teacher Education, 2003
The Online Academy (HO29K73002) was funded by the Office of Special Education Programs (OSEP) to develop research-based online instructional modules in the content areas of reading, positive behavior support and technology across the curriculum. Targeted to preservice teacher education programs in Institutions of Higher Education (IHE), but also…
Descriptors: Teacher Education Programs, Learning Modules, Program Descriptions, Online Systems
Zechner, Klaus; Bejar, Isaac I.; Hemat, Ramin – ETS Research Report Series, 2007
The increasing availability and performance of computer-based testing has prompted more research on the automatic assessment of language and speaking proficiency. In this investigation, we evaluated the feasibility of using an off-the-shelf speech-recognition system for scoring speaking prompts from the LanguEdge field test of 2002. We first…
Descriptors: Role, Computer Assisted Testing, Language Proficiency, Oral Language
Myford, Carol M.; Mislevy, Robert J. – 1995
Establishing and refining a framework for performance assessment is especially difficult in large-scale settings that can involve hundreds of judges and thousands of students. This presentation advocates the interactive use of two complementary analytic perspectives and illustrates the approach in the context of the College Entrance Examination…
Descriptors: Advanced Placement, Art Products, Educational Assessment, Educational Improvement
Hansche, Linda – 1994
Setting standards on performance measures is discussed in the context of the State Collaborative on Assessment and Student Standards (SCASS) initiative supported by the Council of Chief State School Offices. The usual item-based methods for standard setting, the methods developed by Nedelsky (1954), Angoff (1971), and Ebel (1972), were developed…
Descriptors: Decision Making, Educational Assessment, Educational Policy, Elementary Secondary Education
Kemis, Mari R.; And Others – 1990
This study, part of a 10-year longitudinal study assessing the teacher education program at Iowa State University, examined the performance of student teachers and their teaching potential as judged by multiple raters. University supervisors, cooperating teachers, and student teachers (N=260) rated items on a questionnaire addressing professional…
Descriptors: Ability Identification, Comparative Analysis, Cooperating Teachers, Elementary Secondary Education
Halpin, Glennelle; McLean, James E. – 1991
Although the standard-setting method of W. H. Angoff (1971) has broad-based support in the research literature, inconsistencies in the resulting standards do occur. Sources of these inconsistencies are examined in a study of judges, competencies (items), rounds (replications), and the interactions among them. A modified Angoff approach was used to…
Descriptors: Analysis of Variance, Error of Measurement, Evaluators, High Schools
Raymond, Mark R.; Houston, Walter M. – 1990
Performance rating systems frequently use multiple raters in order to improve the reliability of ratings. However, unless all candidates are rated by the same raters, some candidates will be at an unfair advantage or disadvantage solely because they were rated by more stringent or lenient raters. To obtain fair and accurate evaluations of…
Descriptors: Algorithms, Computer Simulation, Educational Assessment, Evaluation Methods
Fiene, Richard; Melnick, Steven A. – 1991
The relationships among independent observer ratings of a child care program on the Early Childhood Environment Rating Scale (ECERS), state department personnel ratings of program quality using the Child Development Program Evaluation Scale (CDPES), and self-evaluation ratings using the self-assessment instrument designed for the Early Childhood…
Descriptors: Accreditation (Institutions), Certification, Child Development Centers, Comparative Testing
Gomez, Joseph J. – 1985
The Management Assessment Center (MAC) of the Dade County (Florida) Public Schools is a unique project, employing multiple techniques to evaluate behavior for school-level administrator personnel selection. This final report of an evaluation deals primarily with issues of the validity and the utility of the MAC. To ascertain validity of the MAC,…
Descriptors: Administrator Evaluation, Administrator Selection, Assessment Centers (Personnel), Correlation
Stolworthy, Reed L. – 1990
This study sought to determine the degree of variance found existing among three different groups of evaluators relative to their assessments of the teaching competencies demonstrated by 60 preservice teacher education students. Data were obtained relative to the undergraduates' self-evaluations regarding the ability to demonstrate 25 teaching…
Descriptors: Analysis of Variance, Cooperating Teachers, Elementary Secondary Education, Higher Education
Auchter, Joan Chikos; Patience, Wayne – 1989
The methods used by the General Educational Development Testing Service (GEDTS) to establish and maintain score stability and reading reliability on its direct assessment of writing are described. Using the 1988 site certification and monitoring results of several scoring sites, the focus is on describing how the score scale was established and…
Descriptors: Decentralization, Equivalency Tests, Essay Tests, Evaluators
Ormrod, Jeanne Ellis; And Others – 1986
This report presents two experiments that compared the performance of map experts to that of novices. Subjects (from the areas of geography, educational psychology, and sociology) were 13 university faculty members in experiment one and 12 undergraduate students in experiment two. Following a practice trial, the learning of a logical map and a…
Descriptors: Analysis of Variance, Cognitive Processes, College Faculty, Content Analysis
Cantor, Nancy K.; Hoover, H. D. – 1986
This paper isolates and examines separately three distinct sources of error in essay scores: lack of agreement between raters; inconsistencies in performance within mode of discourse, and inconsistencies in performance between modes of discourse. Essay prompts in the Iowa Tests of Basic Skills (ITBS) Writing Supplement were designed to assess…
Descriptors: Academic Achievement, Cues, Elementary Secondary Education, Error of Measurement

Peer reviewed
Direct link
