Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Watson, Mary Katherine; Pelkey, Joshua; Noyes, Caroline R.; Rodgers, Michael O. – Journal of Engineering Education, 2016
Background: Conceptual understanding is a prerequisite for engineering competence. Concept maps may be effective tools for assessing conceptual knowledge, yet further work is needed to examine scoring methods. Purpose: Our purpose was to evaluate the efficacy of three concept map scoring methods. Traditional scoring requires judges to count…
Descriptors: Concept Mapping, Concept Formation, Scoring Rubrics, Undergraduate Students
Kuo, Bor-Chen; Chen, Chun-Hua; Yang, Chih-Wei; Mok, Magdalena Mo Ching – Educational Psychology, 2016
Traditionally, teachers evaluate students' abilities via their total test scores. Recently, cognitive diagnostic models (CDMs) have begun to provide information about the presence or absence of students' skills or misconceptions. Nevertheless, CDMs are typically applied to tests with multiple-choice (MC) items, which provide less diagnostic…
Descriptors: Multiple Choice Tests, Responses, Test Items, Models
Dowds, Susan J. Parault; Haverback, Heather Rogers; Parkinson, Meghan M. – Journal of Experimental Education, 2016
This study aimed to determine which types of context clues exist in children's texts and whether it is possible for experts to identify reliably those clues. Three experienced coders used Ames' clue set as a foundation for a system to classify context clues in children's text. Findings showed that the adjustments to Ames' system resulted in 15…
Descriptors: Childrens Literature, Cues, Classification, Coding
Chang, Briana L.; Cromley, Jennifer G.; Tran, Nhi – International Journal of Science and Mathematics Education, 2016
Coordination of multiple representations (CMR) is widely recognized as a critical skill in mathematics and is frequently demanded in reform calculus textbooks. However, little is known about the prevalence of coordination tasks in such textbooks. We coded 707 instances of CMR in a widely used reform calculus textbook and analyzed the distributions…
Descriptors: Calculus, Textbooks, Teaching Methods, Mathematics Instruction
Çetrez-Iscan, Galibiye; Nurçin, Elçin; Fazlioglu, Yesim – Educational Research and Reviews, 2016
Dressing skill is one of the necessary self-care skills that is taught to individuals with autism in order for them to be able to live independently. Typically, developing individuals can acquire dressing skill on their own; however, children with autism have difficulties in learning such skill without systematic teaching. Thus, teaching dressing…
Descriptors: Autism, Measurement Techniques, Cues, Check Lists
Park, Yoon Soo; Hyderi, Abbas; Bordage, Georges; Xing, Kuan; Yudkowsky, Rachel – Advances in Health Sciences Education, 2016
Recent changes to the patient note (PN) format of the United States Medical Licensing Examination have challenged medical schools to improve the instruction and assessment of students taking the Step-2 clinical skills examination. The purpose of this study was to gather validity evidence regarding response process and internal structure, focusing…
Descriptors: Interrater Reliability, Generalizability Theory, Licensing Examinations (Professions), Physicians
Dekker, Vera; Nauta, Maaike H.; Mulder, Erik J.; Sytema, Sjoerd; de Bildt, Annelies – Journal of Autism and Developmental Disorders, 2016
The Social skills Observation Measure (SOM) is a direct observation method for social skills used in naturalistic everyday situations in school. This study describes the development of the SOM and investigates its psychometric properties in 86 children with Autism spectrum disorder, aged 9.8-13.1 years. The interrater reliability was found to be…
Descriptors: Interpersonal Competence, Autism, Pervasive Developmental Disorders, Naturalistic Observation
Kelcey, Ben; Wang, Shanshan; Cox, Kyle – Society for Research on Educational Effectiveness, 2016
Valid and reliable measurement of unobserved latent variables is essential to understanding and improving education. A common and persistent approach to assessing latent constructs in education is the use of rater inferential judgment. The purpose of this study is to develop high-dimensional explanatory random item effects models designed for…
Descriptors: Test Items, Models, Evaluators, Longitudinal Studies
Schultz, Sarah M.; Jacobs, Michelle M.; Gorgos, Kara S.; Wasylyk, Nicole T.; Hanrahan, Sean; Van Lunen, Bonnie L. – Athletic Training Education Journal, 2015
Context: Accuracy of locating various lumbopelvic landmarks for novice athletic trainers has not been examined. Objective: To examine reliability of novice athletic trainers for identification of the L4 spinous process and right and left posterior superior iliac spine (PSIS). Design: Cross-sectional reliability. Setting: Laboratory. Patients or…
Descriptors: Athletics, Allied Health Personnel, Entry Workers, Reliability
Wang, Yanqing; Ai, Wenguo; Liang, Yaowen; Liu, Ying – Journal of Educational Computing Research, 2015
Peer assessment is an efficient and effective learning assessment method that has been used widely in diverse fields in higher education. Despite its many benefits, a fundamental problem in peer assessment is that participants lack the motivation to assess others' work faithfully and fairly. Nonconsensus is a common challenge that makes the…
Descriptors: Peer Evaluation, Student Motivation, Programming Languages, Computer Science Education
Spatar, Ciprian; Penna, Nigel; Mills, Henny; Kutija, Vedrana; Cooke, Martin – Assessment & Evaluation in Higher Education, 2015
Group work can form a substantial component of degree programme assessments. To satisfy institutional and student expectations, students must often be assigned individual marks for their contributions to the group project, typically by mapping a single holistic group mark to individual marks using peer assessment scores. Since the early 1990s,…
Descriptors: Peer Evaluation, Group Activities, Grades (Scholastic), Methods
Meadows, Michelle; Caniglia, Joanne – Science Educator, 2018
Science Olympiad (SO) is a national non-profit organization which holds science competitions for students in grades 7-12 within 50 states with each event aligned to Next Generation Science Standards (NGSS, 2010). The purpose of this article is not only to align the Common Core State Standards in Mathematics (CCSSM) with Science Olympiad events,…
Descriptors: Competition, Science Instruction, Content Analysis, Middle School Students
van Batenburg, Eline S. L.; Oostdam, Ron J.; van Gelderen, Amos J. S.; de Jong, Nivja H. – Language Testing, 2018
This article explores ways to assess interactional performance, and reports on the use of a test format that standardizes the interlocutor's linguistic and interactional contributions to the exchange. It describes the construction and administration of six scripted speech tasks (instruction, advice, and sales tasks) with pre-vocational learners (n…
Descriptors: Second Language Learning, Speech Tests, Interaction, Test Reliability
Kim, Yanghee Anna; An, Sohyun; Kim, Hyun Chu Leah; Kim, Jihye – Journal of Educational Research, 2018
The authors' goal was to identify ways in which Korean immigrant parents define the concept of parental involvement and to examine the statistical significances of interrelationships among these meanings. Seventy-seven parents responded to an open-ended question that asked them to define the meaning of parental involvement; 141 responses were…
Descriptors: Immigrants, Korean Americans, Parent Participation, Correlation
Comparison of Prompting Procedures to Teach Work Tasks to Transition-Aged Students with Disabilities
Riesen, Tim; Jameson, J. Matt – Education and Training in Autism and Developmental Disabilities, 2018
A single subject alternating treatment design was used to compare most-to-least and least-to-most prompts to teach work tasks in community businesses. Four students with moderate to severe disabilities, two paraprofessionals, and one transition teacher participated in the study. Results of the study suggest that both prompting strategies were…
Descriptors: Comparative Analysis, Severe Disabilities, Disabilities, Paraprofessional Personnel

Peer reviewed
Direct link
