Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Aleong, Chandra – Journal of College Teaching & Learning, 2007
This paper discusses whether there are differences in performance based on differences in strategy. First, an attempt was made to determine whether the institution had a strategy, and if so, did it follow a particular model. Major models of strategy are the industry analysis approach, the resource based view or the RBV model and the more recent,…
Descriptors: Strategic Planning, Higher Education, Institutional Evaluation, Case Studies
Bartels, Meike; Boomsma, Dorret I.; Hudziak, James J.; van Beijsterveldt, Toos C. E. M.; van den Oord, Edwin J. C. G. – Psychological Methods, 2007
Genetically informative data can be used to address fundamental questions concerning the measurement of behavior in children. The authors illustrate this with longitudinal multiple-rater data on internalizing problems in twins. Valid information on the behavior of a child is obtained for behavior that multiple raters agree upon and for…
Descriptors: Twins, Behavior Problems, Genetics, Error of Measurement
Hmelo-Silver, Cindy E.; Marathe, Surabhi; Liu, Lei – Journal of the Learning Sciences, 2007
Understanding complex systems is fundamental to understanding science. The complexity of such systems makes them very difficult to understand because they are composed of multiple interrelated levels that interact in dynamic ways. The goal of this study was to understand how experts and novices differed in their understanding of two complex…
Descriptors: Ecology, Anatomy, Physiology, Knowledge Representation
Olver, Mark E.; Wong, Stephen C. P.; Nicholaichuk, Terry; Gordon, Audrey – Psychological Assessment, 2007
The Violence Risk Scale-Sexual Offender version (VRS-SO) is a rating scale designed to assess risk and predict sexual recidivism, to measure and link treatment changes to sexual recidivism, and to inform the delivery of sexual offender treatment. The VRS-SO comprises 7 static and 17 dynamic items empirically or conceptually linked to sexual…
Descriptors: Validity, Rating Scales, Recidivism, Interrater Reliability
de Villiers, Jessica; Fine, Jonathan; Ginsberg, Gary; Vaccarella, Liezanne; Szatmari, Peter – Journal of Autism and Developmental Disorders, 2007
There are few well-standardized measures of conversational breakdown in Autism Spectrum Disorders (ASD). The study's objective was to develop a scale for measuring pragmatic impairments in conversations of individuals with ASD. We analyzed 46 semi-structured conversations of children and adolescents with high-functioning ASD using a functional…
Descriptors: Measures (Individuals), Speech Communication, Semantics, Pragmatics
Guskey, Thomas R. – Educational Measurement: Issues and Practice, 2007
This study compared different stakeholders' perceived validity of various indicators of student learning used to judge the quality of students' academic performance. Data were gathered from the questionnaire responses of 314 educators in three states that have implemented comprehensive state-wide assessment programs with high-stakes consequences…
Descriptors: Academic Achievement, Educational Indicators, State Surveys, Participation
OECD Publishing (NJ1), 2009
The Organisation for Economic Cooperation and Development's (OECD's) Programme for International Student Assessment (PISA) surveys, which take place every three years, have been designed to collect information about 15-year-old students in participating countries. PISA examines how well students are prepared to meet the challenges of the future,…
Descriptors: Policy Formation, Scaling, Academic Achievement, Interrater Reliability
Horng, Eileen Lai; Klasik, Daniel; Loeb, Susanna – National Center for Analysis of Longitudinal Data in Education Research, 2009
School principals have complex jobs. To better understand the work lives of principals, this study uses observational time-use data for all high school principals in Miami-Dade County Public Schools. This paper examines the relationship between the time principals spent on different types of activities and school outcomes including student…
Descriptors: School Effectiveness, Principals, High Schools, Time Management
Lu, Zhihong; Hou, Leijuan; Huang, Xiaohui – International Journal of Education and Development using Information and Communication Technology, 2010
The development and application of Information and Communication Technologies (ICT) in the field of Foreign Language Teaching (FLT) have had a considerable impact on the teaching methodologies in China. With an increasing emphasis on strengthening students' learning initiative and adopting a "student-centred" teaching concept in FLT,…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Second Language Learning
Katz, Irvin R.; Elliot, Norbert; Attali, Yigal; Scharf, Davida; Powers, Donald; Huey, Heather; Joshi, Kamal; Briller, Vladimir – ETS Research Report Series, 2008
This study presents an investigation of information literacy as defined by the ETS iSkills™ assessment and by the New Jersey Institute of Technology (NJIT) Information Literacy Scale (ILS). As two related but distinct measures, both iSkills and the ILS were used with undergraduate students at NJIT during the spring 2006 semester. Undergraduate…
Descriptors: Information Literacy, Information Skills, Skill Analysis, Case Studies
Garet, Michael S.; Cronen, Stephanie; Eaton, Marian; Kurki, Anja; Ludwig, Meredith; Jones, Wehmah; Uekawa, Kazuaki; Falk, Audrey; Bloom, Howard S.; Doolittle, Fred; Zhu, Pei; Sztejnberg, Laura – National Center for Education Evaluation and Regional Assistance, 2008
To help states and districts make informed decisions about the professional development (PD) they implement to improve reading instruction, the U.S. Department of Education commissioned the Early Reading PD Interventions Study to examine the impact of two research-based PD interventions for reading instruction: (1) a content-focused teacher…
Descriptors: Early Reading, Reading Instruction, Professional Development, Intervention
Stone, Gregory Ethan; Beltyukova, Svetlana; Fox, Christine M. – International Journal of Testing, 2008
Judge-mediated examinations are defined as those for which expert evaluation (using rubrics) is required to determine correctness, completeness, and reasonability of test-taker responses. The use of multifaceted Rasch modeling has led to improvements in the reliability of scoring such examinations. The establishment of criterion-referenced…
Descriptors: Interrater Reliability, High Stakes Tests, Standard Setting, Minimum Competencies
Nadeau, Luc; Richard, Jean-Francois; Godbout, Paul – Physical Education and Sport Pedagogy, 2008
Background: Coaches and physical educators must obtain valid data relating to the contribution of each of their players in order to assess their level of performance in team sport competition. This information must also be collected and used in real game situations to be more valid. Developed initially for a physical education class context, the…
Descriptors: Physical Education, Team Sports, Observation, Performance Based Assessment
Ericsson, K. Anders; Roring, Roy W.; Nandagopal, Kiruthiga – High Ability Studies, 2007
The authors are pleased with commentators' willingness to respond to their target article's challenge to identify observable reproducible phenomena that could be widely accepted as strong scientific evidence for innate talent. In this reply, the authors have organized the ideas in the commentaries into three general categories, namely the…
Descriptors: Interrater Reliability, Reader Response, Rote Learning, Creative Thinking
Dixon, Mark R.; Small, Stacey L.; Rosales, Rocio – Behavior Analyst, 2007
The present paper comments on and extends the citation analysis of verbal operant publications based on Skinner's "Verbal Behavior" (1957) by Dymond, O'Hora, Whelan, and O'Donovan (2006). Variations in population parameters were evaluated for only those studies that Dymond et al. categorized as empirical. Preliminary results indicate that the…
Descriptors: Verbal Communication, Citation Analysis, Verbal Operant Conditioning, Meta Analysis

Peer reviewed
Direct link
