Publication Date
| In 2026 | 0 |
| Since 2025 | 7 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 111 |
| Since 2007 (last 20 years) | 257 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 54 |
| Practitioners | 50 |
| Teachers | 23 |
| Administrators | 14 |
| Policymakers | 9 |
| Counselors | 2 |
| Students | 1 |
Location
| United Kingdom | 10 |
| Australia | 9 |
| California | 9 |
| New York | 8 |
| Canada | 6 |
| United Kingdom (England) | 6 |
| Japan | 5 |
| Nebraska | 5 |
| United States | 5 |
| Vermont | 5 |
| Georgia | 4 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 6 |
| Elementary and Secondary… | 2 |
| Every Student Succeeds Act… | 2 |
| Individuals with Disabilities… | 2 |
| Education Amendments 1974 | 1 |
| Education of the Handicapped… | 1 |
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Lee, Sang Min; Puig, Ana; Pasquarella-Daley, Lauren; Denny, George; Rai, Ann Allen; Dallape, Aprille; Parker, Woodrow Max – Measurement and Evaluation in Counseling and Development, 2007
This article describes the revision of the White Racial Consciousness Development Scale (D. Claney & W. M. Parker, 1989). A multistage approach including item generation, item refinement and selection, and evaluation of score validity and reliability was used to test construction and validation. Implications for theory, practice, and future…
Descriptors: Measures (Individuals), Test Construction, Test Items, Scores
May, Henry; Perez-Johnson, Irma; Haimson, Joshua; Sattar, Samina; Gleason, Phil – National Center for Education Evaluation and Regional Assistance, 2009
Securing data on students' academic achievement is typically one of the most important and costly aspects of conducting education experiments. As state assessment programs have become practically universal and more uniform in terms of grades and subjects tested, the relative appeal of using state tests as a source of study outcome measures has…
Descriptors: Testing Programs, Academic Achievement, Researchers, Educational Research
Peer reviewedCalhoun, Angela; And Others – Volta Review, 1988
Twenty normal-hearing, sighted subjects (ages 20-42) viewed soundless videotapes of a speaker reading lists from the two forms of the Utley Lipreading Test and three from Harris' revised Central Institute for the Deaf (CID) Everyday Sentences. Results do not support the interchange of Utley and CID sentences for test-retest comparisons of…
Descriptors: Hearing Impairments, Lipreading, Perception Tests, Test Reliability
Carey, John C.; And Others – Journal of College Student Personnel, 1986
Describes the development of a highly reliable 28-item scale to measure rapport between college roommates. This instrument should be useful in both research and practice. (Author/BL)
Descriptors: College Students, Higher Education, Measures (Individuals), Rapport
Peer reviewedKnox, Marie – Australia and New Zealand Journal of Developmental Disabilities, 1985
The paper describes the Children's Interaction Schedule (CIS), an observation schedule designed to measure children's skills or processes in interacting with peers. There are nine categories of interactive behavior on which the CIS is based. The effectivness of the instrument in a field situation, using an interval-recording technique of behavior…
Descriptors: Children, Disabilities, Interaction, Peer Relationship
Peer reviewedCicchetti, Domenic V.; And Others – Educational and Psychological Measurement, 1984
This program computes multiple judge reliability levels under the following conditions. (1) different sets of judges perform the ratings; (2) the number of judges is a constant; and (3) the scale of measurement is nominal. (Author)
Descriptors: Computer Software, Interrater Reliability, Judgment Analysis Technique, Test Reliability
Cronbach, Lee J. – Center for Research on Evaluation Standards and Student Testing CRESST, 2004
Where the accuracy of a measurement is important, whether for scientific or practical purposes, the investigator should evaluate how much random error affects the measurement. New research may not be necessary when a procedure has been studied enough to establish how much error it involves. But, with new measures, or measures being transferred…
Descriptors: Error of Measurement, Test Reliability, Generalizability Theory, Educational Research
Coscarelli, William; Shrock, Sharon – Performance Improvement Quarterly, 2002
Discusses problems in using traditional measures of reliability for criterion-referenced tests (CRTs) and describes two approaches to reliability for CRTs: estimates sensitive to all measures of error; and estimates of consistency in test outcome. Compares the two approaches and proposes recommendations for interpretation and use. (Author/LRW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Measurement Techniques, Test Reliability
Westhoff, Gerard J. – Language Teaching, 2009
Teachers' competence to estimate the effectiveness of learning materials is important and often neglected in programmes for teacher education. In this lecture I will try to explore the possibilities of designing scaffolding instruments for a "priori" assessment of language learning tasks, based on insights from SLA and cognitive psychology, more…
Descriptors: Cognitive Psychology, Instructional Materials, Instructional Effectiveness, Test Reliability
Tracey, Terence J. G.; Sodano, Sandro M. – Career Development Quarterly, 2008
Interest development is not an easily studied process. There are at least 4 methods for examining the process of stability and change over time: relative stability, absolute stability, profile stability, and structural stability. A program of research that focuses on examining these 4 types of stability is summarized relative to the issues…
Descriptors: Vocational Interests, Childhood Interests, Attitude Change, Research Projects
Satter, Ellyn – Journal of Nutrition Education and Behavior, 2007
The Satter Eating Competence Model (ecSatter) conceptualizes eating competence as having 4 components: eating attitudes, food acceptance, regulation of food intake and body weight, and management of the eating context (including family meals). According to ecSatter, supporting nutritional health requires establishing and maintaining positive…
Descriptors: Body Weight, Nutrition, Eating Habits, Food
Peer reviewedWood, Terry M.; Safrit, Margaret J. – Research Quarterly for Exercise and Sport, 1984
A proposed model for estimating psychomotor test battery reliability, based upon canonical correlation analysis, is described. (Author/JMK)
Descriptors: Evaluation Criteria, Multivariate Analysis, Physical Education, Psychomotor Skills
Raykov, Tenko; du Toit, Stephen H. C. – Structural Equation Modeling: A Multidisciplinary Journal, 2005
A method for estimation of reliability for multiple-component measuring instruments with clustered data is outlined. The approach is applicable with hierarchical designs where individuals are nested within higher order units and exhibit possibly related performance on components of a scale of interest. The procedure is developed within the…
Descriptors: Structural Equation Models, Computation, Measurement Techniques, Test Reliability
Moss, Pamela A. – Journal of Educational and Behavioral Statistics, 2004
The concern behind my question, "Can there be validity without reliability?" (Moss, 1994), was about the influence of measurement practices on the quality of education. I argued that conventional operationalizations of reliability in the measurement literature, which I summarized as "consistency, quantitatively defined, among independent…
Descriptors: Psychometrics, Measurement Techniques, Test Validity, Test Reliability
Porter, Andrew; Goldring, Ellen; Elliott, Stephen; Murphy, Joseph; Polikoff, Morgan; Cravens, Xiu – Online Submission, 2008
The Vanderbilt Assessment of Leadership in Education is a 360 assessment of principals' learning-centered leadership behaviors. The instrument was designed to provide formative and summative assessment to principals on the leadership behaviors most important to student learning. The purpose of this report is to describe the standard-setting…
Descriptors: Elementary Secondary Education, Standard Setting, Leadership Effectiveness, Cutting Scores

Direct link
