Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Sincar, Mehmet – Educational Sciences: Theory and Practice, 2013
The purpose of this study is to determine the challenges school principals facing in the context of technology leadership. This is a qualitative case study guided by the National Educational Technology Standards for Administrators (NETS*A). Six elementary school principals working in a large city in southeastern Turkey participated into the study.…
Descriptors: Foreign Countries, Principals, Semi Structured Interviews, Validity
Haberman, Shelby J. – Educational Testing Service, 2011
Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…
Descriptors: Writing Tests, Scoring, Essays, Language Tests
Holifield-Scott, April – ProQuest LLC, 2011
A study was conducted to determine the extent to which high school and college/university Advanced Placement English Language and Composition readers value and implement the curricular requirements of Advanced Placement English Language and Composition. The participants were 158 readers of the 2010 Advanced Placement English Language and…
Descriptors: Advanced Placement, English Instruction, Writing (Composition), English Curriculum
Eisenkraft, Arthur; Eisenkraft, Noah – Journal of College Science Teaching, 2011
To find out whether the education community shares a collective understanding about how students should be evaluated, we surveyed 202 educators (from all grade levels) and scientists attending assessment workshops (Pennsylvania, California, and Massachusetts) or judging a national student competition (Washington, DC). The educators and scientists…
Descriptors: Student Evaluation, Scientists, Grades (Scholastic), Grading
Ghirardelli, Alyssa; Quinn, Valerie; Sugerman, Sharon – Journal of Nutrition Education and Behavior, 2011
Objective: To develop a retail grocery instrument with weighted scoring to be used as an indicator of the food environment. Participants/Setting: Twenty six retail food stores in low-income areas in California. Intervention: Observational. Main Outcome Measure(s): Inter-rater reliability for grocery store survey instrument. Description of store…
Descriptors: Interrater Reliability, Marketing, Scoring, Correlation
Shirazi, Mandana; Sadeghi, Majid; Emami, A.; Kashani, A. Sabouri; Parikh, Sagar; Alaeddini, F.; Arbabi, Mohammad; Wahlstrom, Rolf – Academic Psychiatry, 2011
Objective: Standardized patients (SPs) have been developed to measure practitioner performance in actual practice settings, but results have not been fully validated for psychiatric disorders. This study describes the process of creating reliable and valid SPs for unannounced assessment of general-practitioners' management of depression disorders…
Descriptors: Medical Education, Acting, Patients, Role Playing
Jefferies, Ann; Simmons, Brian; Ng, Eugene; Skidmore, Martin – Advances in Health Sciences Education, 2011
Competency based medical education involves assessing physicians-in-training in multiple roles. Training programs are challenged by the need to introduce appropriate yet feasible assessment methods. We therefore examined the utility of a structured oral examination (SOE) in the assessment of the 7 CanMEDS roles (Medical Expert, Communicator,…
Descriptors: Medical Education, Competence, Medical Students, Student Evaluation
Sadler, D. Royce – Quality in Higher Education, 2011
The tension between the freedom of academics to grade the achievements of their students without interference or coercion and the prerogative of higher education institutions to control grading standards is often deliberated by weighing up the authority and rights of the two parties. An alternative approach is to start with an analysis of the…
Descriptors: Higher Education, Academic Freedom, Academic Achievement, Academic Standards
Jitendra, Asha K.; Burgess, Clare; Gajria, Meenakshi – Exceptional Children, 2011
Educators have widely used cognitive strategy instruction to address reading comprehension deficits evidenced by students with learning disabilities. However, no one has yet conducted a review of the quality of this literature. This review applies the quality indicators advocated by Gersten et al. (2005) and Horner et al. (2005) to evaluate the…
Descriptors: Reading Comprehension, Learning Disabilities, Effect Size, Cognitive Processes
Alhaisoni, Eid – English Language Teaching, 2012
This study investigates the writing revision strategies used by 16 Saudi English as foreign language (EFL) students. Two research methods were employed. First, think-aloud reporting was used to gain insight into the thought processes utilized by the students, and to study the revision strategies that Saudi male university students make use of…
Descriptors: Foreign Countries, Protocol Analysis, English (Second Language), Second Language Instruction
Chang, Chi-Cheng; Wu, Bing-Hong – Educational Technology & Society, 2012
This study explored the reliability and validity of teacher assessment under a Web-based portfolio assessment environment (or Web-based teacher portfolio assessment). Participants were 72 eleventh graders taking the "Computer Application" course. The students perform portfolio creation, inspection, self- and peer-assessment using the Web-based…
Descriptors: Computer Oriented Programs, Validity, Portfolio Assessment, Internet
Barnett, Lisa; van Beurden, Eric; Morgan, Philip J.; Lincoln, Doug; Zask, Avigdor; Beard, John – Research Quarterly for Exercise and Sport, 2009
An important aspect in studies concerning fundamental motor skills (FMS) proficiency is interrater objectivity (or interrater reliability), defined as the consistency or agreement in scores obtained from two or more raters. In a training setting, interrater objectivity is commonly determined as the relative number of times raters agree with an…
Descriptors: Physical Activities, Observation, Interrater Reliability, Psychomotor Skills
Shriberg, Lawrence D.; Fourakis, Marios; Hall, Sheryl D.; Karlsson, Heather B.; Lohmeier, Heather L.; McSweeny, Jane L.; Potter, Nancy L.; Scheer-Cohen, Alison R.; Strand, Edythe A.; Tilkens, Christie M.; Wilson, David L. – Clinical Linguistics & Phonetics, 2010
A companion paper describes three extensions to a classification system for paediatric speech sound disorders termed the Speech Disorders Classification System (SDCS). The SDCS uses perceptual and acoustic data reduction methods to obtain information on a speaker's speech, prosody, and voice. The present paper provides reliability estimates for…
Descriptors: Phonemes, Phonetic Transcription, Reliability, Classification
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability
Gunlicks-Stoessel, Meredith; Mufson, Laura; Jekal, Angela; Turner, J. Blake – Journal of Consulting and Clinical Psychology, 2010
Objective: Aspects of depressed adolescents' perceived interpersonal functioning were examined as moderators of response to treatment among adolescents treated with interpersonal psychotherapy for depressed adolescents (IPT-A; Mufson, Dorta, Moreau, & Weissman, 2004) or treatment as usual (TAU) in school-based health clinics. Method:…
Descriptors: Low Income, Mothers, Conflict, Rating Scales

Peer reviewed
Direct link
