Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Kumar, Vivekanandan S.; Boulanger, David – International Journal of Artificial Intelligence in Education, 2021
This article investigates the feasibility of using automated scoring methods to evaluate the quality of student-written essays. In 2012, Kaggle hosted an Automated Student Assessment Prize contest to find effective solutions to automated testing and grading. This article: a) analyzes the datasets from the contest -- which contained hand-graded…
Descriptors: Automation, Scoring, Essays, Writing Evaluation
Stolpe, Karin; Björklund, Lars; Lundström, Mats; Åström, Maria – Higher Education: The International Journal of Higher Education Research, 2021
Previous research shows a discrepancy between different teachers' assessment of student theses. This might be an even larger problem in the context of teacher education, since teacher trainers originate from different disciplines. This study aims to investigate how different assessors prioritise between criteria for assessment. Criteria were…
Descriptors: Student Evaluation, Theses, Evaluation Criteria, Evaluation Methods
Sas, Marlies; Snaphaan, Thom; Pauwels, Lieven J. R.; Ponnet, Koen; Hardyns, Wim – Field Methods, 2023
This study focuses on the use of systematic social observations (SSO) to measure crime prevention through environmental design (CPTED) and disorder. To improve knowledge about measurement issues in small area research, SSO is conducted by means of three different methods: in-situ, photographs, and Google Street View (GSV) imagery. By evaluating…
Descriptors: Crime Prevention, Measurement Techniques, Photography, Observation
Zhou, Shuqi; Merzdorf, Hillary E.; Douglas, Kerrie A.; Moore, Tamara J. – Journal of Pre-College Engineering Education Research, 2023
This study aimed to develop a K-12 classroom observation protocol to assess K-12 teachers' implementation of science, technology, engineering, and mathematics (STEM) integration. The intended purpose of the observation protocol is for researchers to examine how K-12 teachers implement the STEM integrated curriculum. Based on research on STEM…
Descriptors: Test Construction, Test Validity, STEM Education, Classroom Observation Techniques
Asim, Hafiz Muhmmad; Vaz, Anthony; Mansoori, Shaheen; Ahmed, Ashfaq; Akram, Rizwan; Sadiq, Samreen; Hussain, Haseeb; Aziz, Amer – International Education Studies, 2023
The current research focused on the designing of questionnaire for factors that impact student learning outcomes in tertiary educational system in underdeveloped nation Pakistan. A pilot study was conducted for the designing of questionnaire to collect data on perceived factors that impact student learning outcomes. The selected Higher Education…
Descriptors: Foreign Countries, Outcomes of Education, Higher Education, Student Attitudes
Reflective Minds, Brighter Futures: Empowering Critical Reflection with a Guided Instructional Model
Trixie James; Hayley Griffin; Katrina S. Johnston; Frank Armstrong – Journal of University Teaching and Learning Practice, 2023
Critical thinking is recognised as instrumental for positive, personal and professional, long-term outlooks. It is also widely accepted that the development of students' critical thinking skills can be achieved through explicit interventions. This paper documents the outcomes of a pilot study that investigated the value and impact of an…
Descriptors: Critical Thinking, Reflection, Teaching Methods, Thinking Skills
Huscroft-D'Angelo, Jacqueline; Wery, Jessica; Martin-Gutel, Jodie D.; Pierce, Corey; Loftin, Kara – Assessment for Effective Intervention, 2022
The Scales for Assessing Emotional Disturbance Screener--Third Edition (SAED-3) is a standardized, norm-referenced measure designed to identify school-age students at risk for emotional and behavioral problems. Four studies are reported to address the psychometric status of the SAED-3 Screener. Study 1 examined the internal consistency of the…
Descriptors: Emotional Disturbances, Test Reliability, Test Validity, Screening Tests
Whalen, Kate; Paez, Antonio – Journal of Geography, 2022
Experiential education partnered with guided reflection is thought to support students with higher-order thinking skills. In this study, 44 reflections from two university-level sustainability courses were compared. In both courses students were asked to write a reflection, but only one course used the Reflective Learning Framework (RLF). Tests of…
Descriptors: Geography Instruction, Thinking Skills, Experiential Learning, Sustainability
Venkatraman, Yamini; Mahalingam, Shenbagavalli; Boominathan, Prakash – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) is a standardized instrument used in voice assessment to assess voice quality. It has been translated and culturally adapted in several languages. This study aimed at developing and validating a Tamil version of CAPE-V through auditory perceptual evaluation of remotely…
Descriptors: Sentences, Dravidian Languages, Acoustics, Auditory Perception
Katherine Drinkwater Gregg; Olivia Ryan; Andrew Katz; Mark Huerta; Susan Sajadi – Journal of Engineering Education, 2025
Background: Courses in engineering often use peer evaluation to monitor teamwork behaviors and team dynamics. The qualitative peer comments written for peer evaluations hold potential as a valuable source of formative feedback for students, yet little is known about their content and quality. Purpose: This study uses a large language model (LLM)…
Descriptors: Artificial Intelligence, Technology Uses in Education, Engineering Education, Student Evaluation
Williams, Logan; Kemp, Simon – Assessment & Evaluation in Higher Education, 2019
We examined the reliability of grading master's theses at a New Zealand university, where a variant of the academic journal review system is employed. The overall correlation between the grades recommended by internal and external markers of master's theses in psychology and applied psychology at this university was 0.39, which is similar to that…
Descriptors: Interrater Reliability, Masters Theses, Foreign Countries, Grades (Scholastic)
Abdalla, Widad – ProQuest LLC, 2019
Trend scoring is often used in large-scale assessments to monitor for rater drift when the same constructed response items are administered in multiple test administrations. In trend scoring, a set of responses from Time "A" are rescored by raters at Time "B." The purpose of this study is to examine the ability of…
Descriptors: Scoring, Interrater Reliability, Test Items, Error Patterns
Franz Holzknecht; Sandrine Tornay; Alessia Battisti; Aaron Olaf Batty; Katja Tissi; Tobias Haug; Sarah Ebling – Language Assessment Quarterly, 2024
Although automated spoken language assessment is rapidly growing, such systems have not been widely developed for signed languages. This study provides validity evidence for an automated web application that was developed to assess and give feedback on handshape and hand movement of L2 learners' Swiss German Sign Language signs. The study shows…
Descriptors: Sign Language, Vocabulary Development, Educational Assessment, Automation
Koutsoftas, Anthony D.; Srivastava, Pradyumn; Harris, Sarah B. – Topics in Language Disorders, 2020
Spelling is an important skill that requires knowledge of phonology, morphology, and orthography, as well as strong visual memory. In this study, we introduce a spelling coding rubric that accounts for different knowledge types needed for spelling and can be used to describe error patterns for both encoding and decoding as part of the writing…
Descriptors: Spelling, Writing Processes, Intermediate Grades, Elementary School Students
Myers, Aaron J.; Ames, Allison J.; Leventhal, Brian C.; Holzman, Madison A. – Applied Measurement in Education, 2020
When rating performance assessments, raters may ascribe different scores for the same performance when rubric application does not align with the intended application of the scoring criteria. Given performance assessment score interpretation assumes raters apply rubrics as rubric developers intended, misalignment between raters' scoring processes…
Descriptors: Scoring Rubrics, Validity, Item Response Theory, Interrater Reliability

Peer reviewed
Direct link
