Publication Date
| In 2026 | 0 |
| Since 2025 | 58 |
| Since 2022 (last 5 years) | 284 |
| Since 2017 (last 10 years) | 780 |
| Since 2007 (last 20 years) | 2042 |
Descriptor
| Interrater Reliability | 3124 |
| Foreign Countries | 655 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 25 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Wolfe, Edward W.; Matthews, Staci; Vickers, Daisy – Journal of Technology, Learning, and Assessment, 2010
This study examined the influence of rater training and scoring context on training time, scoring time, qualifying rate, quality of ratings, and rater perceptions. One hundred twenty raters participated in the study and experienced one of three training contexts: (a) online training in a distributed scoring context, (b) online training in a…
Descriptors: Writing Evaluation, Writing Tests, Qualifications, Program Effectiveness
Kapci, Emine Gul; Kucuker, Sevgi; Uslu, Runa I. – Topics in Early Childhood Special Education, 2010
The majority of eligible children cannot access early intervention services in Turkey, often because they are not assessed. The authors adapted the "Ages and Stages Questionnaires" (ASQ) for Turkish children ages 3 to 72 months. Study participants consisted of 375 children who were classified as at risk for developmental delays, 564…
Descriptors: Early Intervention, Eligibility, Classification, Foreign Countries
Joseph, Dana L.; Newman, Daniel A. – Educational and Psychological Measurement, 2010
A major stumbling block for emotional intelligence (EI) research has been the lack of adequate evidence for discriminant validity. In a sample of 280 dyads, self- and peer-reports of EI and Big Five personality traits were used to confirm an a priori four-factor model for the Wong and Law Emotional Intelligence Scale (WLEIS) and a five-factor…
Descriptors: Emotional Intelligence, Measurement Techniques, Validity, Personality Traits
Curcic, Svjetlana; Johnstone, Robin S. – Computers in the Schools, 2016
This study examined the effects of an intervention in writing with digital interactive books. To improve the writing skills of seventh- and eighth-grade students with a learning disability in reading, we conducted a quasi-experimental study in which the students read interactive digital books (i-books), took notes, wrote summaries, and acted as…
Descriptors: Intervention, Writing Skills, Learning Disabilities, Cartoons
Mlodinow, Leonard – Chronicle of Higher Education, 2008
In this article, the author talks about the release of the most comprehensive study of SAT exams. The headline on the Web site of the College Board, the maker of the test, was, "SAT Studies Show Test's Strength in Predicting College Success." At the same time, a headline on the Web site of the group FairTest, a 23-year-old, nonprofit…
Descriptors: Writing Tests, Academic Achievement, Grading, Standardized Tests
Boisvert, Michelle K. – ProQuest LLC, 2012
There is a national shortage of school-based Speech Language Pathologists (SLP). Schools located in rural and geographically remote areas are often impacted by the shortage, and as a result students with an autism spectrum disorder may not receive services that are mandated by their Individual Education Plan. This study examined the use of…
Descriptors: Investigations, Speech Language Pathology, Intervention, Autism
Clarkeburn, Henriikka; Kettula, Kirsi – Teaching in Higher Education, 2012
This study looks at the fairness of assessing learning journals both as the fairness in creating a valid and robust marking process as well as how different student groups may have unfair disadvantages in performing well in reflective assessment tasks. The fairness of a marking process is discussed through reflecting on the practical process and…
Descriptors: Student Evaluation, Reflection, Summative Evaluation, Formative Evaluation
Leung, Kai-Kuen; Wang, Wei-Dan; Chen, Yen-Yuan – Advances in Health Sciences Education, 2012
There is a lack of information on the use of multi-source evaluation to assess trainees' interpersonal and communication skills in Oriental settings. This study is conducted to assess the reliability and applicability of assessing the interpersonal and communication skills of family medicine residents by patients, peer residents, nurses, and…
Descriptors: Foreign Countries, Clinical Teaching (Health Professions), Communication Skills, Patients
Ferster, Bill; Hammond, Thomas C.; Alexander, R. Curby; Lyman, Hunt – Journal of Interactive Learning Research, 2012
The hurried pace of the modern classroom does not permit formative feedback on writing assignments at the frequency or quality recommended by the research literature. One solution for increasing individual feedback to students is to incorporate some form of computer-generated assessment. This study explores the use of automated assessment of…
Descriptors: Feedback (Response), Scripts, Formative Evaluation, Essays
Koder, Deborah-Anne; Klahr, Amanda – Educational Gerontology, 2010
The Mini-Mental State Examination (MMSE) is one of the most commonly used instruments to screen for cognitive deficits within the hospital setting. However training in how to administer this widely used tool is scarce with little, if any, formal training for nursing staff. Scores are also often misused with over reliance on results and cut-offs to…
Descriptors: Nursing Education, Nurses, Dementia, Knowledge Level
Park, Juyoung; Castellanos-Brown, Karen; Belcher, John – Research on Social Work Practice, 2010
Objective: Pain assessment for nonverbal older adults with cognitive impairments or dementia presents many challenges, and it is important to determine which scales are most useful in assessing pain among this population. Method: In this review 11 observational scales for assessment of pain in older adults with dementia or cognitive impairments…
Descriptors: Pain, Dementia, Older Adults, Psychometrics
Gustafsson, Peik; Svedin, Carl Goran; Ericsson, Ingegerd; Linden, Christian; Karlsson, Magnus K.; Thernlund, Gunilla – Developmental Medicine & Child Neurology, 2010
Aim: To study the value and reliability of an examination of neurological soft-signs, often used in Sweden, in the assessment of children with attention-deficit-hyperactivity disorder (ADHD), by examining children with and without ADHD, as diagnosed by an experienced clinician using the DSM-III-R. Method: We have examined interrater reliability…
Descriptors: Physical Education, Attention Deficit Hyperactivity Disorder, Children, Test Validity
Alt, Mary; Meyers, Christina; Figueroa, Cecilia – Journal of Speech, Language, and Hearing Research, 2013
Purpose: The purpose of this study was to determine whether children exposed to 2 languages would benefit from the phonotactic probability cues of a single language in the same way as monolingual peers and to determine whether crosslinguistic influence would be present in a fast-mapping task. Method: Two groups of typically developing children…
Descriptors: Regression (Statistics), Spanish, Cues, Task Analysis
Wang, Ping – English Language Teaching, 2009
This paper makes a study of the rater reliability in scoring composition in the test of English as a foreign language (EFL) and focuses on the inter-rater reliability as well as several interactions between raters and the other facets involved (that is examinees, rating criteria and rating methods). Results showed that raters were fairly…
Descriptors: Interrater Reliability, Scoring, Writing (Composition), English (Second Language)
Jaberg, Peter E.; Dixon, David J.; Weis, Glenna M. – Canadian Journal of School Psychology, 2009
The Devereux Early Childhood Assessment (DECA) was developed to assess the social-emotional functioning of preschool children. The developers of the DECA report initial validity and reliability evidence in support of the use of the instrument with 2- to 5-year-old children across the United States. There is further need to collect independent…
Descriptors: Validity, Test Reliability, Interrater Reliability, Factor Structure

Peer reviewed
Direct link
