Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Hafner, John C.; Hafner, Patti M. – International Journal of Science Education, 2003
Although the rubric has emerged as one of the most popular assessment tools in progressive educational programs, there is an unfortunate dearth of information in the literature quantifying the actual effectiveness of the rubric as an assessment tool "in the hands of the students." This study focuses on the validity and reliability of the rubric as…
Descriptors: Interrater Reliability, Generalizability Theory, Biology, Scoring Rubrics
Protopapas, Athanassios; Skaloumbakas, Christos – Journal of Learning Disabilities, 2007
In this study, we examined the characteristics of reading disability (RD) in the seventh grade of the Greek educational system and the corresponding diagnostic practice. We presented a clinically administered assessment battery, composed of typically employed tasks, and a fully automated, computer-based assessment battery that evaluates some of…
Descriptors: Foreign Countries, Computer Assisted Testing, Grade 7, Reading Difficulties
Boyle, Michael H.; Cunningham, Charles E.; Georgiades, Katholiki; Cullen, John; Racine, Yvonne; Pettingill, Peter – Journal of Child Psychology and Psychiatry, 2009
Background: This study examines the use of the Brief Child and Family Phone Interview (BCFPI) to screen for childhood psychiatric disorder based on Diagnostic Interview Schedule for Children Version IV (DISC-IV) classifications of attention-deficit hyperactivity disorder (ADHD), oppositional defiant disorder (ODD), conduct disorder (CD),…
Descriptors: Health Services, Mental Health Programs, Mental Health, Child Health
Fago, George C. – 1995
This study examined the validity of an instrument designed to measure the effectiveness of faculty advisers to freshmen at a small, private, liberal arts college (approximately 1,100 students). The 19-question Advising Effectiveness Questionnaire (AEQ) was distributed to three successive freshman classes at the end of each academic year. Factor…
Descriptors: Academic Advising, College Students, Evaluation Methods, Faculty Advisers
Feldt, Leonard S. – 1983
This paper considers, from a theoretical point of view, two measurement approaches used in measuring success and failure in skills tests in physical education. The first, "fixed length" (FL) testing, entails counting the number of successful performances in a fixed number of trials. The second, "trials-to-criterion" (TTC)…
Descriptors: Evaluation Methods, Mathematical Formulas, Mathematical Models, Measurement Techniques
Brown, William R. – 1988
The evaluation tools written by teachers are rarely valid or reliable. One teaching aid that can help in the creation of an effective evaluation instrument is called a test map. A test map is a systematic method to consider variables that are important in the construction of the format of a test. Five variables that are discussed in the test…
Descriptors: Elementary Secondary Education, Evaluation Methods, Higher Education, Student Evaluation
Carr, Marion – 1983
The faculty of an intensive program of English as a second language for college-bound students, questioning the objectivity of faculty evaluations of non-native college applicants' written essays, assessed the existing evaluation process, reformed it, tested it, and planned for ongoing development. In the first stage, readers read and graded…
Descriptors: Admission Criteria, College Applicants, English (Second Language), Essays
West, Ellen M. – 1977
This study tested the validity and reliability of the West Informal Reading Evaluation (WIRE), an unobtrusive screening device that identifies adult learners' reading abilities. Of the 154 students in basic education or high school completion courses who completed one form of WIRE, 123 subjects were administered another reading test that served as…
Descriptors: Adult Basic Education, Adults, Evaluation Methods, Informal Reading Inventories
Stiggins, Richard J. – 1981
An area of current concern is that of the advantages and disadvantages of measuring writing proficiency directly via writing samples, and indirectly via objective tests. Much research has been completed documenting the correlation between direct and indirect measures. However, there had not yet been a systematic and detailed conceptual analysis…
Descriptors: Comparative Analysis, Elementary Secondary Education, Evaluation Methods, Higher Education
Baker, Robert, F.; And Others – 1980
A set of procedures were developed for evaluating the State Capacity Building Programs; (SCBP), state projects for increasing facilities for the dissemination of information related to education. Six scales were developed, based on questionnaire items, to evaluate the following six facets of state information-dissemination systems: comprehensive…
Descriptors: Evaluation Criteria, Evaluation Methods, Information Dissemination, Latent Trait Theory
PDF pending restorationEstes, Carole; Estes, Gary D. – 1980
Multiple matrix sampling is a sampling design in which both test items and examinees are randomly sampled from their respective populations. This study was designed to develop and assess a method for computing an estimate of a correlation coefficient when a multiple matrix sampling design is used. The examinee populations included 212 third-grade…
Descriptors: Correlation, Elementary Secondary Education, Evaluation Methods, Grade 3
Mauch, James E.; And Others – 1970
Fundamental problems in the collection and use of data, in particular the need to develop and use criteria for data, are discussed. Specific examples of the use and misuse of data related to teachers and children in the public schools are presented. Two methods of data collection--manifest techniques and formal response instruments--are evaluated…
Descriptors: Data Analysis, Data Collection, Educational Research, Evaluation Methods
Schools Council, London (England). – 1966
Thirty-two secondary schools participated in the testing of: (1) musical literacy (reading and listening); (2) general musical knowledge; and (3) corporate music making. The total number of candidates were 410 pupils of whom 260 were girls, and 150 were boys. The first two parts of the test were based on excerpts from recorded music, presented in…
Descriptors: Audiotape Recordings, Bulletins, Evaluation Methods, Listening Skills
Welsh, James – 1971
The Pennsylvania Plan for Educational Quality Assessment is discussed as to its beginnings, its goals, its implementation, and its future. The plan aids schoolmen of the State in assessing the quality of education their schools are providing in relation to 10 goals, which propose to encourage the students to develop self-understanding,…
Descriptors: Educational Objectives, Educational Quality, Evaluation Criteria, Evaluation Methods
Peer reviewedReynolds, William M.; Baker, Jean A. – American Journal of Mental Retardation, 1988
The Self-Report Depression Questionnaire (SRDQ), a measure of depressive symptomatology in persons with mental retardation, was administered to 89 mentally retarded adults living in community-based settings. The SRDQ demonstrated high internal consistency reliability, as well as moderate stability over an 11-week period. Content validity and…
Descriptors: Adults, Community Programs, Depression (Psychology), Evaluation Methods

Direct link
