Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Bywater, Tracey; Gridley, Nicole; Berry, Vashti; Blower, Sarah; Tobin, Kate – Child Care in Practice, 2019
Background: Group-based parent programmes demonstrate positive benefits for adult and child mental health, and child behaviour outcomes. Greater fidelity to the programme delivery model equates to better outcomes for families attending, however, fidelity is typically self-monitored using programme specific checklists. Self-completed measures are…
Descriptors: Foreign Countries, Parent Education, Check Lists, Psychometrics
Önen, Emine; Yayvak, Melike Kübra Tasdelen – Journal of Education and Training Studies, 2019
In this study, it was aimed to examine the interrater reliability of the scoring of paragraph writing skills on foreign languages with the measurement invariance tests. The study group consists of 267 students studying English at the Preparatory School at Gazi University. In the study, where students write a paragraph on the same topic, the…
Descriptors: Second Language Learning, Second Language Instruction, Factor Analysis, English (Second Language)
Díez-Arcón, Paz – JALT CALL Journal, 2023
Language MOOC research has evolved over the last three years to a more mature stage in which researchers have gained a deeper comprehension of the theories that enable effective language learning in this format. The application of these theoretical advances should be reflected in the instructional design of the courses. This study is based on this…
Descriptors: MOOCs, Second Language Learning, Second Language Instruction, Learning Theories
Lanah Stafford; Erin Cousins; Linda Bol; Megan Mize – Research & Practice in Assessment, 2023
Integrative learning is an important outcome for graduates of higher education. Therefore, it should be well-defined and assessed reliably. The American Association of Colleges & Universities has developed a rubric to define and assess integrative learning, but it has low reliability. This pilot study examines whether this rubric's reliability…
Descriptors: Scoring Rubrics, Reliability, Evaluation Methods, Faculty Development
Karusoo-Musumeci, Ava; Pearce, Wendy M.; Donaghy, Michelle – Child Language Teaching and Therapy, 2022
Oral narrative assessments are important for diagnosis of language disorders in school-age children so scoring needs to be reliable and consistent. This study explored the impact of training on the variability of story grammar scores in children's oral narrative assessments scored by multiple raters. Fifty-one speech pathologists and 19 final-year…
Descriptors: Oral Language, Speech Evaluation, Language Impairments, Elementary School Students
Ashley Marinez – ProQuest LLC, 2020
The purpose of the current study was to determine interrater reliability (IRR) of Oral Reading Fluency (ORF) Curriculum-based Measures (R-CBM) when used with Spanish-speaking English Learner (EL) students. The ORF R-CBM probes obtained from AIMSweb are measures of a student's reading accuracy skills and reading fluency skills. Certified school…
Descriptors: Interrater Reliability, English (Second Language), Reading Fluency, Curriculum Based Assessment
Gorman, Brenda K.; Martinez, Guadalupe; Pina Garcia, Lindsay – Bilingual Research Journal, 2021
The goal of this investigation was to address gaps in the literature regarding dual-language learning in children with Down syndrome (DS). The investigators examined the outcomes of a dual-language narrative intervention and cross-language transfer in a bilingual (Spanish-English) adolescent with DS. Results revealed moderate to large effects in…
Descriptors: Bilingualism, Down Syndrome, Spanish, English (Second Language)
Lyness, Scott A.; Peterson, Kent; Yates, Kenneth – Education Sciences, 2021
The Performance Assessment for California Teachers (PACT) is a high stakes summative assessment that was designed to measure pre-service teacher readiness. We examined the inter-rater reliability (IRR) of trained PACT evaluators who rated 19 candidates. As measured by Cohen's weighted kappa, the overall IRR estimate was 0.17 (poor strength of…
Descriptors: High Stakes Tests, Performance Based Assessment, Teacher Effectiveness, Academic Language
Leaman, Marion C.; Edmonds, Lisa A. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: This study evaluated interrater reliability (IRR) and test-retest stability (TRTS) of seven linguistic measures (percent correct information units, relevance, subject-verb-[object], complete utterance, grammaticality, referential cohesion, global coherence), and communicative success in unstructured conversation and in a story narrative…
Descriptors: Aphasia, Psychometrics, Correlation, Speech Language Pathology
Finn, Bridgid; Wendler, Cathy; Ricker-Pedley, Kathryn L.; Arslan, Burcu – ETS Research Report Series, 2018
This report investigates whether the time between scoring sessions has an influence on operational and nonoperational scoring accuracy. The study evaluates raters' scoring accuracy on constructed-response essay responses for the "GRE"® General Test. Binomial linear mixed-effect models are presented that evaluate how the effect of various…
Descriptors: Intervals, Scoring, Accuracy, Essay Tests
Gauns Dessai, Kissan G.; Kamat, Venkatesh V. – International Journal of Information and Communication Technology Education, 2018
Educational institutions worldwide conduct summative examinations to evaluate academic performance of students. Such summative examinations are normally subjective in nature in higher education institutions and needs manual evaluation. However, the manual evaluation of subjective answer-scripts often suffers from evaluation anomalies and the…
Descriptors: Computer Assisted Testing, Student Evaluation, Scoring Rubrics, Error Patterns
Klecker, Beverly M. – Online Submission, 2018
The Council for the Accreditation of Educator Preparation Programs (CAEP), required evidence of reliability and validity of measures used in a university's Educator Preparation Program (EPP). This paper describes processes that provided this evidence for the Teacher Performance Assessment (TPA). Literature examined included Messick (1989), Linn…
Descriptors: College Faculty, Teacher Evaluation, Performance Based Assessment, Test Validity
Miles, Anna – International Journal of Language & Communication Disorders, 2017
Background: Oesophageal abnormalities are common findings in a speech-language therapy videofluoroscopy clinic. Fluoroscopic screening involving oropharynx alone fails to identify these patients. Oesophageal screening as an adjunct to videofluoroscopy is gaining popularity. Yet currently, little is known about the reliability of speech and…
Descriptors: Interrater Reliability, Speech Therapy, Allied Health Personnel, Speech Language Pathology
Aragón, Sonia; Lapresa, Daniel; Arana, Javier; Anguera, M. Teresa; Garzón, Belén – Measurement in Physical Education and Exercise Science, 2017
Polar coordinate analysis is a powerful data reduction technique based on the Zsum statistic, which is calculated from adjusted residuals obtained by lag sequential analysis. Its use has been greatly simplified since the addition of a module in the free software program HOISAN for performing the necessary computations and producing…
Descriptors: Physical Activities, Track and Field, Data Analysis, Males
Smith, Grant S.; Paige, David D. – Reading Psychology, 2019
Becoming a fluent reader has been established as important to reading comprehension. Prosody (expression) is an indicator of fluent reading that is linked to improved comprehension in students across elementary, middle, and secondary grades. Fluent reading is most often evaluated by classroom teachers through the use of a rubric, with the most…
Descriptors: Interrater Reliability, Oral Reading, Reading Fluency, National Competency Tests

Peer reviewed
Direct link
