Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Pare, D. E.; Joordens, S. – Journal of Computer Assisted Learning, 2008
As class sizes increase, methods of assessments shift from costly traditional approaches (e.g. expert-graded writing assignments) to more economic and logistically feasible methods (e.g. multiple-choice testing, computer-automated scoring, or peer assessment). While each method of assessment has its merits, it is peer assessment in particular,…
Descriptors: Writing Assignments, Undergraduate Students, Teaching Assistants, Peer Evaluation
Stoddard, Sarah A.; Kubik, Martha Y.; Skay, Carol – Journal of School Nursing, 2008
The Institute of Medicine recommends school-based body mass index (BMI) screening as an obesity prevention strategy. While school nurses have provided height/weight screening for years, little has been published describing measurement reliability or process. This study evaluated the reliability of height/weight measures collected by school nurses…
Descriptors: Obesity, Body Composition, School Nurses, Interrater Reliability
Maguire, Phil; Devereux, Barry; Costello, Fintan; Cater, Arthur – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2007
The competition among relations in nominals (CARIN) theory of conceptual combination (C. L. Gagne & E. J. Shoben, 1997) proposes that people interpret nominal compounds by selecting a relation from a pool of competing alternatives and that relation availability is influenced by the frequency with which relations have been previously associated…
Descriptors: Competition, Program Validation, Item Analysis, Human Relations
McDowell, Brona C.; Kerr, Claire; Parkes, Jackie – Developmental Medicine & Child Neurology, 2007
Gross Motor Function Classification System (GMFCS) level was reported by three independent assessors in a population of children with cerebral palsy (CP) aged between 4 and 18 years (n=184; 112 males, 72 females; mean age 10y 10mo [SD 3y 7mo]). A software algorithm also provided a computed GMFCS level from a regional CP registry. Participants had…
Descriptors: Cerebral Palsy, Parents, Classification, Interrater Reliability
Adams, Jonathan; Eveland, Vicki – Journal of Marketing for Higher Education, 2007
A total of 150 university Web sites were segregated into one of three groups: accredited residential, regionally accredited online, and nonaccredited online institutions. The promotional imagery, marketing messages and marketing themes found on the landing pages of each university program Web sites were analyzed for similarities and differences. A…
Descriptors: Web Sites, Higher Education, Student Recruitment, Classification
White, Alfred H. – American Annals of the Deaf, 2007
The article reports the development of the Structural Analysis of Written Language (SAWL), an instrument designed for use by classroom teachers in objectively documenting the ability of children to write in English. The SAWL allows teachers to use T-unit analysis to quantitatively assess language improvement regardless of whether the student…
Descriptors: Written Language, Evaluation Methods, Printed Materials, Morphemes
Martorell, A.; Pereda, A.; Salvador-Carulla, L.; Ochoa, S.; Ayuso-Mateos, J. L. – Journal of Intellectual Disability Research, 2007
Background: There is little information on the psychometric properties of instruments for assessing family care burden in adults with intellectual disabilities (ID). The aim of this study is therefore to analyse the usefulness of the 'Subjective and Objective Family Burden Interview' (SOFBI) in the assessment of principal caregivers in Spain.…
Descriptors: Foreign Countries, Caregivers, Validity, Factor Analysis
Campbell, Cynthia; Collins, Vicki L. – Educational Measurement: Issues and Practice, 2007
We reviewed the five top-selling introductory assessment textbooks in both general and special education to identify topics contained in textbooks and to determine the extent of agreement among authors regarding the essentialness of topics within and across discipline. Content analysis across the 10 assessment textbooks yielded 73 topics related…
Descriptors: Special Education, Content Analysis, Textbook Evaluation, Textbook Content
Doabler, Christian; Smolkowski, Keith; Fien, Hank; Kosty, Derek B.; Cary, Mari Strand – Society for Research on Educational Effectiveness, 2010
In this paper, the authors report research focused directly on the validation of the Coding of Academic Teacher-Student interactions (CATS) direct observation instrument. They use classroom information gathered by the CATS instrument to better understand the potential mediating variables hypothesized to influence student achievement. Their study's…
Descriptors: Feedback (Response), Curriculum Based Assessment, Observation, Construct Validity
Banda, Sekelani S. – Anatomical Sciences Education, 2009
There are concerns in the literature that the use of case-based teaching of anatomy could be compromising the depth and scope of anatomy learned by students in a problem-based learning curriculum. Poor selection of clinical cases that are used as vehicles for teaching/learning anatomy may be the root problem because some clinical cases do not…
Descriptors: Problem Based Learning, Anatomy, Case Method (Teaching Technique), Evaluation Methods
Oriogun, Peter K. – American Journal of Distance Education, 2009
In this article, the community of inquiry cognitive presence model was mapped to a recently developed semistructured approach to online discourse called SQUAD to detect aspects of critical or higher-order thinking online by cleaning online message transcript through the use of code-recode. It is argued that using code-recode in the way suggested…
Descriptors: Interrater Reliability, Thinking Skills, Transcripts (Written Records), Coding
Peer reviewedWyatt, W. Joseph; And Others – Psychology in the Schools, 1985
Compared traditional percentage and correlational methods of estimating reliability of duration recording to reliability obtained with event-by-event examination of observers' records in which the actual percentage of time that the observers were in agreement was calculated. Traditional percentage reliability scores were found to be significantly…
Descriptors: College Students, Correlation, Higher Education, Interrater Reliability
Peer reviewedHarrington, Robert G.; And Others – Psychology in the Schools, 1985
Evaluated interscorer reliability of the Spatial Memory subtest, which appears on the Simultaneous Processing scale of the Kaufman Assessment Battery for Children. Responses from 19 gifted children were scored by two independent examiners. Results showed this subtest may be prone to scoring errors because no permanent record of responses exists.…
Descriptors: Elementary Education, Gifted, Interrater Reliability, Preadolescents
Peer reviewedWeider-Hatfield, Deborah; Hatfield, John D. – Communication Quarterly, 1984
Evaluation approaches to measuring reliabilty in interaction analysis by (1) presenting criteria for a sound reliability estimate, (2) evaluating currently used tests against these criteria, and (3) discussing application of appropriate tests to interaction data. (PD)
Descriptors: Communication Research, Evaluation Criteria, Interaction Process Analysis, Interrater Reliability
Peer reviewedOrwin, Robert G.; Cordray, David S. – Psychological Bulletin, 1985
Identifies three sources of reporting deficiency for meta-analytic results: quality (adequacy) of publicizing; quality of macrolevel reporting, and quality of microlevel reporting. Reanalysis of 25 reports from the Smith, Glass and Miller (1980) psychotherapy meta-analysis established two sources of misinformation, interrater reliabilities and…
Descriptors: Confidence Testing, Interrater Reliability, Meta Analysis, Psychotherapy

Direct link
