Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Hagaman, Jessica L.; Casey, Kathryn J.; Reid, Robert – Preventing School Failure, 2016
Reading comprehension is important for academic success and is a skill required for many activities in school and beyond. This study investigated the effects of the TRAP (Think before you read, Read a paragraph, Ask myself, "What was this paragraph mostly about?" and Put it into my own words) paraphrasing strategy taught using the…
Descriptors: Middle School Students, Rural Schools, Reading Difficulties, Reading Comprehension
Simin, Cai; Lam, Toh Tin – Journal of Science and Mathematics Education in Southeast Asia, 2016
This paper presents the design and development of a rubric for assessing mathematical modelling for mathematical modelling tasks at the secondary level. A rubric was crafted based on the mathematical modelling competencies synthesised by the researchers identified from four sources. The rubric was fine-tuned following an interview with three…
Descriptors: Scoring Rubrics, Test Construction, Mathematical Models, Secondary School Mathematics
Fuller, Charles Avery – ProQuest LLC, 2016
Beginning with the 2010-2011 school year the North Carolina State Board of Education (SBE) mandated the use of the North Carolina Teacher Evaluation Process (Evaluation Process) for use in all public school systems in the state to conduct teacher observations and evaluations. The Evaluation Process replaced the Teacher Performance Appraisal…
Descriptors: Rural Schools, Principals, Teacher Evaluation, State Standards
Steedle, Jeffrey; LaSalle, Amy – Partnership for Assessment of Readiness for College and Careers, 2016
Partnership for Assessment of Readiness for College and Careers (PARCC) Operational Study 4 Component 3 was designed to compare performance on PARCC mathematics field-test items for grade 3 taken with and without a drawing tool. For the 2016 testing window, five field-test items were selected to have the directions edited to allow students to…
Descriptors: Grade 3, Mathematics Tests, Test Items, Freehand Drawing
Polišenská, Kamila; Kapalková, Svetlana – Journal of Speech, Language, and Hearing Research, 2014
Purpose: A range of nonword repetition (NWR) tasks are used in research and clinical applications, but compliance rates among young children remain low. Live presentation is usually used to improve compliance rates, but this lacks the consistency of recorded stimuli. In this study, the authors examined whether a novel delivery of NWR stimuli based…
Descriptors: Compliance (Psychology), Repetition, Young Children, Age Differences
White, Taylor – Carnegie Foundation for the Advancement of Teaching, 2014
New teacher evaluation systems have emerged as the cornerstone of the recent movement to improve public school teaching. Fueled by incentives from the federal government, state and local policymakers have sought to replace the often-cursory evaluation models of the past with more comprehensive ones. In contrast to past evaluations, which often…
Descriptors: Teacher Evaluation, Student Attitudes, Principals, Observation
Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017
Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…
Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability
Dunlap, Glen; Kern, Lee; dePerczel, Maria; Clarke, Shelley; Wilson, Diane; Childs, Karen E.; White, Ronnie; Falk, George D. – Behavioral Disorders, 2018
Functional assessment and functional analysis are processes that have been applied successfully in work with people who have developmental disabilities, but they have been used rarely with students who experience emotional or behavioral disorders. In the present study, five students in elementary school programs for severe emotional disturbance…
Descriptors: Emotional Disturbances, Behavior Disorders, Developmental Disabilities, Elementary School Students
Warrens, Matthijs J. – Psychometrika, 2012
The quadratically weighted kappa is the most commonly used weighted kappa statistic for summarizing interrater agreement on an ordinal scale. The paper presents several properties of the quadratically weighted kappa that are paradoxical. For agreement tables with an odd number of categories "n" it is shown that if one of the raters uses the same…
Descriptors: Interrater Reliability, Statistics, Measurement
Zayac, Ryan M.; Ratkos, Thom; Frieder, Jessica E.; Paulk, Amber – Teaching of Psychology, 2016
Research on teaching has shown that incorporating active student responding (ASR) into classroom instruction facilitates learning and should be considered best practice. Nevertheless, few published studies have examined ASR using a within-participant design across a semester. Using a counterbalanced alternating treatment design, a direct…
Descriptors: Audience Response Systems, Undergraduate Students, Psychology, Comparative Analysis
Choo, Dawn; Dettman, Shani J. – Deafness & Education International, 2016
During the pre- and post-implant habilitation process, mothers of children using cochlear implants may be coached by clinicians to use appropriate communicative strategies during play according to the family's choice of communication approach. The present study compared observations made by experienced and inexperienced individuals in the analysis…
Descriptors: Parent Child Relationship, Mothers, Video Technology, Observation
Kim, Dong-gook; Helms, Marilyn M. – Journal of Education for Business, 2016
Assurance of learning (AoL) processes for continuous improvement and accreditation require business schools to assess program goals. Findings from the process can lead to changes in course design or curriculum. Often AoL assignments are embedded into existing courses and assessed at regular intervals. Faculty members may evaluate an assignment in…
Descriptors: Course Evaluation, Business Administration Education, Business Schools, College Faculty
Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019
As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…
Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)
Alegre de la Rosa, Olga María; Villar Angulo, Luis Miguel – Education Sciences, 2019
This study aims to investigate whether emotional and behavioral difficulties (EBD) differ between children with cochlear implants (CIs) or hearing aids (HAs), according to multi-informant ratings. Methods: A battery of psychological measures (e.g., Strengths and Difficulties Questionnaire (SDQ), Illinois Test of Psycholinguistic Abilities (ITPA),…
Descriptors: Behavior Disorders, Emotional Disturbances, Hearing Impairments, Assistive Technology
Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019
Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…
Descriptors: Oral Language, Language Tests, Test Validity, Inferences

Peer reviewed
Direct link
