Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Lartz, M. N.; Litchfield, S. K. – American Annals of the Deaf, 2005
Deaf Education Teacher Preparation Programs must prepare teachers to staff an increasing number of oral programs. A survey was conducted to determine which competencies administrators of deaf education programs rate as important for teachers in oral programs and to compare ratings of these competencies by oral school administrators to ratings made…
Descriptors: Preservice Teachers, Deafness, Surveys, Administrators
Berg, Marie; Jahnsen, Reidun; Froslie, Kathrine Frey; Hussain, Aktahr – Physical & Occupational Therapy in Pediatrics, 2004
Pediatric Evaluation of Disability Inventory (PEDI) is an instrument for evaluating function in children with disabilities aged 6 months to 7.5 years. The PEDI measures both functional performance and capability in three domains: (1) self-care, (2) mobility, and (3) social function. The PEDI has recently been translated into Norwegian. The purpose…
Descriptors: Disabilities, Young Children, Measures (Individuals), Norwegian
Davies, Patricia L.; Soon, Pepper Lee; Young, Michele; Clausen-Yamaki, Amy – Physical & Occupational Therapy in Pediatrics, 2004
This study examined validity of the School Function Assessment (SFA) and interrater reliability of occupational therapist and teacher ratings of students' school function. The validity of the SFA was examined using the known-group method in 35 participants in kindergarten through 7th grade attending elementary schools; 15 students with learning…
Descriptors: Validity, Interrater Reliability, Elementary School Students, Student Evaluation
Dixon, Marlene A.; Cunningham, George B. – Measurement in Physical Education and Exercise Science, 2006
Understanding that the behavior of people takes place within a context, over the past 20 years research in education and the sport sciences has witnessed an increasing development of multilevel frameworks that are both conceptually and methodologically sound. Despite these advances, the use of multilevel models and research designs in education…
Descriptors: Physical Activities, Statistical Data, Statistical Studies, Statistical Analysis
Miller, David; Parker, Donna – Education 3-13, 2006
Although there is a debate about the importance of self-esteem in education, many primary teachers wish to help children who suffer from low self-esteem. However, in order to do this, we first have to identify such children. It is almost taken for granted that we can make quite accurate judgements based on the knowledge built up through day-to-day…
Descriptors: Self Esteem, Teacher Surveys, Teacher Attitudes, Student Surveys
Malone, Margaret E. – Foreign Language Annals, 2003
Since their initial publication in 1982, the ACTFL Guidelines and oral proficiency interview (OPI) have enjoyed widespread use by foreign language educators. They have also been the target of much criticism by researchers of second language acquisition and testing. Much of this criticism has focused on validity claims for the OPI. Other research…
Descriptors: Futures (of Society), Language Tests, Interrater Reliability, Criticism
Conti-Ramsden, Gina; Simkin, Zoe; Pickles, Andrew – Journal of Speech, Language, and Hearing Research, 2006
Purpose: Two approaches commonly used for estimating prevalence of language disorders in families were compared. The 1st involved examining a subset of language items from an investigator-based interview used to record parental information on the language and literacy difficulties in relatives. The 2nd was the direct assessment of ability in…
Descriptors: Comparative Analysis, Parents, Interviews, Incidence
Scahill, Lawrence; McDougle, Christopher J.; Williams, Susan K.; Dimitropoulos, Anastasia; Aman, Michael G.; McCracken, James T.; Tierney, Elaine; Arnold, L. Eugene; Cronin, Pegeen; Grados, Marco; Ghuman, Jaswinder; Koenig, Kathleen; Lam, Kristen S. L.; McGough, James; Posey, David J.; Ritz, Louise; Swiezy, Naomi B.; Vitiello, Benedetto – Journal of the American Academy of Child and Adolescent Psychiatry, 2006
Objective: To examine the psychometric properties of the Children's Yale-Brown Obsessive Compulsive Scales (CYBOCS) modified for pervasive developmental disorders (PDDs). Method: Raters from five Research Units on Pediatric Psychopharmacology (RUPP) Autism Network were trained to reliability. The modified scale (CYBOCS-PDD), which contains only…
Descriptors: Children, Severity (of Disability), Test Reliability, Behavior Disorders
Vendlinski, Terry P.; Nagashima, Sam; Herman, Joan L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007
Current educational policy highlights the important role that assessment can play in improving education. State standards and the assessments that are aligned with them establish targets for learning and promote school accountability for helping all students succeed; at the same time, feedback from assessment results is expected to provide …
Descriptors: Elementary School Science, Federal Legislation, State Standards, Educational Improvement
Alderson, J. Charles; And Others – 1995
The guide is intended for teachers who must construct language tests and for other professionals who may need to construct, evaluate, or use the results of language tests. Most examples are drawn from the field of English-as-a-Second-Language instruction in the United Kingdom, but the principles and practices described may be applied to the…
Descriptors: Educational Trends, English (Second Language), Interrater Reliability, Language Tests
Myerberg, N. James – 1996
The Montgomery County (Maryland) public school system has started using assessments other than multiple-choice tests because it is felt that this will provide school staff with better information about the success of the instructional program. One of the ways assessments can provide better information is by having teachers score student papers.…
Descriptors: Accountability, Achievement Tests, Educational Assessment, Elementary Secondary Education
DeMauro, Gerald E. – 1995
Studies of the Angoff method of standard setting suggest that judges agree in their estimates of the relative difficulties of test questions for minimally competent examinees and that each judge's estimates correlate well with the observed item difficulties for examinees whose total test scores are near the judge's personal standard (G. E.…
Descriptors: Ability, Competence, Construct Validity, Difficulty Level
Porter, Don; O'Sullivan, Barry – 1994
A study investigated how perception of the reader's age in relation to the age of the writer affects assessment of writing. Subjects were 26 Japanese women college students of English as a Second Language, all of whom had recently participated in a home-stay program in an English-speaking country. They were given the task of writing brief letters…
Descriptors: Age Differences, Audience Awareness, College Students, English (Second Language)
Henning, Grant; And Others – 1995
A prototype revised form of the Test of Spoken English (TSE) was compared with the current version of the same test, comparing interrater reliability, frequency of rater discrepancy at all score levels, component task adequacy, scoring efficacy, and other concurrent and construct validity evidence, including the oral proficiency interview…
Descriptors: Adults, College Students, Comparative Analysis, English (Second Language)
Rudner, Lawrence M. – 1992
Several common sources of error in assessment that depends on the use of judges are identified, and ways to reduce the impact of rating errors are examined. Numerous threats to the validity of scores based on ratings exist. These threats include: (1) the halo effect; (2) stereotyping; (3) perception differences; (4) leniency/stringency error; and…
Descriptors: Alternative Assessment, Error of Measurement, Evaluation Methods, Evaluators

Peer reviewed
Direct link
