Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Cecchetti, Alfred A. – ProQuest LLC, 2009
Objective: This dissertation developed an automatic classification procedure, as an example of a novel tool for an informationist, which extracts information from published abstracts, classifies abstracts into their "fields of study," and then determines the researcher's "field of study" and "level of activity." …
Descriptors: Medical Research, Medical Schools, Medicine, Classification
Education Resource Strategies, 2009
Resources matter. How well schools and districts use their people, time, and money is often even more important than how much they receive. Education Resource Strategies' extensive research with districts and schools shows that despite differences in school level, size, location, student population, or even instructional focus, high-performing…
Descriptors: Educational Resources, Institutional Characteristics, Differences, Effective Schools Research
Burke, Mack D.; Vannest, Kimberly; Davis, John; Davis, Cole; Parker, Richard – Behavioral Disorders, 2009
This study is a preliminary examination of the reliability of frequent retrospective teacher behavior ratings. Frequent retrospective behavior ratings are an approach for creating scales that can be used to monitor individual behavioral progress. In this study, the approach is used to progress monitor behavioral individualized education plan goals…
Descriptors: Elementary School Students, Teacher Behavior, Student Behavior, Topography
Tanaka, Koji – Educational Studies in Japan: International Yearbook, 2009
The recent "Nationwide academic achievement and study situation survey" was clearly influenced by the idea of "authentic assessment", an educational assessment perspective focused on "quality" and "engagement". However, when "performance assessment", the assessment method corresponding to this…
Descriptors: Educational Assessment, Performance Based Assessment, Academic Achievement, Educational Research
Young, John W. – Educational Assessment, 2009
In this article, I specify a conceptual framework for test validity research on content assessments taken by English language learners (ELLs) in U.S. schools in grades K-12. This framework is modeled after one previously delineated by Willingham et al. (1988), which was developed to guide research on students with disabilities. In this framework…
Descriptors: Test Validity, Evaluation Research, Achievement Tests, Elementary Secondary Education
Daniel, Mark; Cargo, Margaret; Marks, Elisabeth; Paquet, Catherine; Simmons, David; Williams, Margaret; Rowley, Kevin; O'Dea, Kerin – Social Indicators Research, 2009
This study reports on the development and evaluation of a rating tool to assess the scientific utility and cultural appropriateness of community-level indicators for application with Indigenous populations. Indicator criteria proposed by the U.S. Institute of Medicine were culturally adapted through reviewing the literature and consultations with…
Descriptors: Research Design, Indigenous Populations, Public Health, Content Validity
Collishaw, Stephan; Goodman, Robert; Ford, Tamsin; Rabe-Hesketh, Sophia; Pickles, Andrew – Journal of Child Psychology and Psychiatry, 2009
Background: Assessments of child psychopathology commonly rely on multiple informants, e.g., parents, teachers and children. Informants often disagree about the presence or absence of symptoms, reflecting reporter bias, situation-specific behaviour, or random variation in measurement. However, few studies have systematically tested how far…
Descriptors: Psychopathology, Interrater Reliability, Children, Parents
Chitiyo, Morgan; Wheeler, John J. – Preventing School Failure, 2009
The reauthorization of the Individuals with Disabilities Education Act of 1997 emphasized the use of positive behavioral interventions, supports, and services for students with disabilities who display challenging behaviors. Unfortunately, most teachers and schools still lack systems for identification, adoption, and sustained use of these…
Descriptors: Behavior Modification, Disabilities, Technical Assistance, Consultation Programs
Tasse, Marc J.; And Others – 1994
The Quebec Adaptive Behavior Scale (QABS) is widely used in Quebec (Canada) to assess behavior of people with mental retardation in educational, vocational, residential or hospital settings. This study estimated the interrater agreement and test-retest reliability of the QABS. To determine test-retest reliability, the QABS was completed by 27…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Elementary Secondary Education, Foreign Countries
Ahadi, Stephan A.; And Others – 1990
The reliability and validity of teacher ratings, the relationship between teacher ratings and principal self-reports of instructional leadership, and the degree to which they are influenced by demographic factors are examined in this study. Methodology involved completion of the Instructional Leadership Inventory, a self-report measure, by 81…
Descriptors: Educational Environment, Elementary Secondary Education, Institutional Characteristics, Instructional Leadership
Aydin, Selami – Turkish Online Journal of Educational Technology - TOJET, 2006
This research aimed to investigate the effect of computers on the test and inter-rater reliability of writing test scores of ESL learners. Writing samples of 20 pen-paper and 20 computer group students were scored in analytic scoring method by two scorers, and then the scores were analyzed in Alpha (Cronbach) model. The results showed that the…
Descriptors: Foreign Countries, College Students, Computer Assisted Testing, English (Second Language)
Bunch, Michael B.; Littlefair, Wendy – 1988
A total of 2,000 essays written by 1,000 students was submitted to generalizability analyses for domain-referenced tests. Each student had written one essay on each of two prompts representing two models of discourse. Each essay was read by six readers and judged on a scale of from 1 to 4. No reader read essays from both prompts. Reader agreement…
Descriptors: Cutting Scores, Essay Tests, Generalizability Theory, Interrater Reliability
Primoff, Ernest S. – 1971
This report shows how Beta weights for the J-Coefficient may be easily developed without a formal validity study, and indicates how indications of ability other than tests can be used to measure the same abilities that are measured by tests. See also TM 001 163-64,166 for further information on job elements (J-Scale) procedures. (Author/DLG)
Descriptors: Achievement Rating, Correlation, Evaluation Criteria, Occupational Tests
Love, Judith A.; And Others – 1977
Perhaps more than ever before, college teaching is being studied and evaluated. This paper describes the development of a simple descriptive instrument used to focus observers' classifications and ratings of college teachers' instructional behaviors as recorded on video tape. The need for such an instrument is reviewed, the methodology for testing…
Descriptors: Classroom Observation Techniques, College Instruction, Correlation, Factor Analysis
Gilbert, Sharon L. – 1997
This study examined whether variations in the Developmental Observation Checklist (DC) format influences congruence of scores among both parents and the child's teacher. The DC was varied by adding pictorial illustrations and examples and having three response categories instead of two. Results from 100 sets of participants were evaluated with…
Descriptors: Check Lists, Developmental Delays, Early Intervention, Fathers

Direct link
Peer reviewed
