NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
Individuals with Disabilities…1
What Works Clearinghouse Rating
Showing 1 to 15 of 72 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Øydis Hide; Dagrun Slettebø Daltveit; Åse Sivertsen; Anne Katherine Hvistendahl; Randi Lovise Kjerstad; Marit Berntsen Kvinnsland; Nina Helen Pedersen; Christina Sørensen – International Journal of Language & Communication Disorders, 2025
Background: Cleft lip and palate (CLP) treatment in Norway is centralized and multidisciplinary, with long-term follow-up from birth to adulthood. The Norwegian Registry of Cleft Lip and Palate was established to ensure high-quality care and enable systematic data collection. Speech data are a key component, assessed by speech--language therapists…
Descriptors: Foreign Countries, Validity, Reliability, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Pearson, Terry – FORUM: for promoting 3-19 comprehensive education, 2023
Ofsted has frequently defended the judgements made during inspections by claiming that inspection ratings are reliable, as shown by the results from the collection of studies the inspectorate has conducted. I outline the inspectorate's view of reliability and problematise the studies that it has carried out, noting that these provide insufficient…
Descriptors: Inspection, Interrater Reliability, Decision Making, Value Judgment
Peer reviewed Peer reviewed
Direct linkDirect link
Dart, Evan H.; Radley, Keith C. – Psychology in the Schools, 2023
Single-case design is a research methodology that entails repeated measurement to assess the influence of an independent variable on a dependent variable over time. Data collected in this manner are regularly analyzed using visual analysis of data displayed in a linear graph. Although there is agreement regarding critical elements of visual…
Descriptors: Research Design, Research Methodology, Data Collection, Data Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Belur, Jyoti; Tompson, Lisa; Thornton, Amy; Simon, Miranda – Sociological Methods & Research, 2021
A methodologically sound systematic review is characterized by transparency, replicability, and a clear inclusion criterion. However, little attention has been paid to reporting the details of interrater reliability (IRR) when multiple coders are used to make decisions at various points in the screening and data extraction stages of a study. Prior…
Descriptors: Interrater Reliability, Decision Making, Accuracy, Coding
Peer reviewed Peer reviewed
Direct linkDirect link
Taylor, Tessa; Lanovaz, Marc J. – Journal of Applied Behavior Analysis, 2022
Behavior analysts typically rely on visual inspection of single-case experimental designs to make treatment decisions. However, visual inspection is subjective, which has led to the development of supplemental objective methods such as the conservative dual-criteria method. To replicate and extend a study conducted by Wolfe et al. (2018) on the…
Descriptors: Visual Perception, Artificial Intelligence, Decision Making, Evaluators
Peer reviewed Peer reviewed
Direct linkDirect link
João M. Santos – Research Evaluation, 2024
The allocation of scientific funding through grant programs is crucial for research advancement. While independent peer panels typically handle evaluations, their decisions can lean on personal preferences that go beyond the stated criteria, leading to inconsistencies and potential biases. Given these concerns, our study employs a novel method,…
Descriptors: Grants, Program Proposals, Funding Formulas, Scientific Research
Peer reviewed Peer reviewed
Direct linkDirect link
Matthews, Joshua – RELC Journal: A Journal of Language Teaching and Research, 2023
This article explores how the analysis of inter-rater discourse can be used to support collective reflective practice in second language (L2) assessment. To demonstrate, a focused case of the discourse between two experienced language teachers as they negotiate assessment decisions on L2 written texts is presented. Of particular interest was the…
Descriptors: Interrater Reliability, Discourse Analysis, Student Evaluation, Second Language Learning
Goldhaber, Dan; Grout, Cyrus; Wolf, Malcom; Martinkova, Patricia – National Center for Analysis of Longitudinal Data in Education Research (CALDER), 2020
There is growing interest in using measures of teacher applicant quality to improve hiring decisions, but the statistical properties of such measures are poorly understood. We present evidence on structured ratings solicited from teacher applicants' references. We find that the reference ratings capture only one underlying dimension of applicant…
Descriptors: Job Applicants, Teacher Selection, Interrater Reliability, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Peer reviewed Peer reviewed
Direct linkDirect link
Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024
This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…
Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023
Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…
Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication
Peer reviewed Peer reviewed
Direct linkDirect link
Robert J. Sternberg; Jenna Landy; Jennifer Long – Roeper Review, 2024
Procedures for identifying the gifted often make use of tests of general intelligence, among other assessments. Robert J. Sternberg recently suggested that identification of the gifted should further involve assessment of what he refers to as adaptive intelligence--the ability to adapt to real-world environments. Such a conception of intelligence…
Descriptors: Intelligence, Intelligence Tests, Gifted, Identification
Peer reviewed Peer reviewed
Direct linkDirect link
Cychosz, Margaret; Cristia, Alejandrina; Bergelson, Elika; Casillas, Marisa; Baudet, Gladys; Warlaumont, Anne S.; Scaff, Camila; Yankowitz, Lisa; Seidl, Amanda – Developmental Science, 2021
This study evaluates whether early vocalizations develop in similar ways in children across diverse cultural contexts. We analyze data from daylong audio recordings of 49 children (1-36 months) from five different language/cultural backgrounds. Citizen scientists annotated these recordings to determine if child vocalizations contained canonical…
Descriptors: Cultural Context, Contrastive Linguistics, Audio Equipment, Cultural Differences
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5