Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 117 |
| Since 2017 (last 10 years) | 228 |
| Since 2007 (last 20 years) | 561 |
Descriptor
| Evaluation Methods | 1408 |
| Test Reliability | 1408 |
| Test Validity | 954 |
| Student Evaluation | 339 |
| Test Construction | 305 |
| Foreign Countries | 217 |
| Higher Education | 183 |
| Measurement Techniques | 170 |
| Psychometrics | 168 |
| Elementary Secondary Education | 147 |
| Evaluation Criteria | 122 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 74 |
| Practitioners | 72 |
| Teachers | 29 |
| Administrators | 18 |
| Policymakers | 11 |
| Students | 4 |
| Counselors | 3 |
| Support Staff | 3 |
| Community | 1 |
| Parents | 1 |
Location
| Australia | 24 |
| United Kingdom | 22 |
| Canada | 18 |
| Turkey | 16 |
| China | 14 |
| United States | 14 |
| California | 11 |
| Netherlands | 10 |
| Florida | 9 |
| Texas | 8 |
| United Kingdom (England) | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
PDF pending restorationKriewall, Thomas E. – 1972
The measurement information generated by CRT's is designed for use in instructional management systems where classifications of pupils for treatment are to be decided on the basis of minimal data consistent with predetermined limits for the errors of misclassification. The measures obtained are content specific estimates of proficiency useful for…
Descriptors: Ability Grouping, Academic Achievement, Criterion Referenced Tests, Decision Making
Peer reviewedRichards, P. Scott; And Others – Computers in the Schools, 1986
Describes the development and preliminary validation of the Computer Attitudes Scale (CAS), which is designed to provide researchers and educators with a way of assessing some basic student attitudes about computer usage. (MBR)
Descriptors: Analysis of Variance, Attitude Measures, Elementary Secondary Education, Evaluation Methods
Peer reviewedRubin, Rebecca B. – Western Journal of Speech Communication, 1986
Argues in defense of Rubin's CCAI, that Powell and Avila's study did not use adequate controls to assess the instrument's reliability. Asserts that their study simply discovered differences in communication competence among ethnic groups. (MS)
Descriptors: Academic Achievement, Blacks, Communication Research, Comparative Analysis
Peer reviewedCason, Gerald J.; Cason, Carolyn L. – Evaluation and the Health Professions, 1984
The proposed theory provides a basis for both measuring and correcting rater stringency error in some grossly incomplete rating data matrices. The theoretical model fits ratings made by faculty and resident physicians of student clinical performance in each of three junior year medical student cohorts better than alternative models. (Author)
Descriptors: Clinical Teaching (Health Professions), Evaluation Criteria, Evaluation Methods, Higher Education
Peer reviewedPalumbo, David B.; Reed, W. Michael – Journal of Research on Computing in Education, 1989
Discussion of the use of microcomputers in education focuses on the use of microcomputers for evaluation purposes. A study that collected performance data on both college students and the evaluation instrument is described, reliability and validity of the test are discussed, and the use of microcomputers to redesign instructional systems is…
Descriptors: Academic Achievement, Computer Assisted Instruction, Computer Assisted Testing, Computer Managed Instruction
Peer reviewedWattenbarger, James L.; McLeod, Norman – Community College Review, 1989
Discusses a study of the effectiveness of employing multiple measures and sources of data to place students in appropriate mathematics courses. Considers the use of achievement test scores, high school mathematics grades, clinical evaluation techniques, and content-oriented examinations in assessment. (DMM)
Descriptors: Achievement Tests, Community Colleges, Evaluation Methods, Evaluation Problems
Peer reviewedMeisels, Samuel J.; And Others – Early Childhood Research Quarterly, 1995
Examined the reliability and validity of the Work Sampling System (WSS) for evaluating the schoolwork of 100 kindergarten children. Results indicated that the WSS checklist and summary report had very high internal and moderately high interrater reliability. The WSS accurately predicted the performance of the children on a norm-referenced…
Descriptors: Academic Achievement, Achievement Tests, Check Lists, Early Childhood Education
Peer reviewedParker, Richard I.; And Others – Exceptional Children, 1991
This study investigated the technical adequacy of 7 objective indexes of writing quality in monitoring the progress of 36 middle school students with mild disabilities over a 6-month period. Three indexes (such as "percent of legible words") were moderately correlated with holistic ratings but were not sufficiently stable over time. (Author/JDD)
Descriptors: Evaluation Methods, Holistic Approach, Intermediate Grades, Junior High Schools
Peer reviewedHerrington, Jan; Herrington, Anthony – Higher Education Research and Development, 1998
Describes seven defining characteristics of "authentic" assessment and how they have been operationalized in an interactive multimedia learning environment in preservice teacher education. A study showed students responded favorably to the elements of authentic assessment and had a good understanding of the software's content; this was…
Descriptors: College Instruction, College Students, Computer Assisted Testing, Evaluation Methods
Peer reviewedMertler, Craig A. – Mid-Western Educational Researcher, 2000
A study of methods used to ensure validity and reliability in classroom assessments surveyed 625 elementary and secondary teachers in Ohio. Results indicate teachers spent little time conducting statistical analyses of their student evaluation data, and many techniques used were poor and inadequate. Additional professional development and improved…
Descriptors: Educational Needs, Educational Objectives, Educational Practices, Elementary Secondary Education
Peer reviewedMelnick, Susan L.; Pullin, Diana – Journal of Teacher Education, 2000
Examines recent implementation of the controversial Massachusetts Educator Certification Tests and the educational, legal, and public policy issues in the implementation of a teacher testing program. The paper focuses on: the contexts for teacher testing; the tests themselves; the quality of the tests; and legal issues in teacher testing and…
Descriptors: Accountability, Educational Policy, Elementary Secondary Education, Evaluation Methods
Peer reviewedAnderson, Diane; Reilly, Judy – Journal of Deaf Studies and Deaf Education, 2002
This article discusses the development of the MacArthur Communicative Development Inventory for American Sign Language (ASL-CDI), a parent report that measures early sign production. Normative data from 69 children (8-36 months) with deafness and their parents with deafness found the development of the ASL-CDI has been successful. (Contains…
Descriptors: American Sign Language, Deafness, Evaluation Methods, Infants
Peer reviewedSarouphim, Ketty M. – Exceptional Children, 1999
A comparison between performance-based DISCOVER (Discovering Intellectual Strengths and Capabilities through Observation while allowing for Varied Ethnic Responses) assessment reports and two independent ratings in appraising 24 kindergartners' multiple intelligences through specific activities found that when intelligences were assessed through…
Descriptors: Ability Identification, Activities, Classroom Observation Techniques, Cultural Background
Comfort, Marilee; Gordon, Philip R.; Unger, Donald G. – Zero to Three, 2006
The Keys to Interactive Parenting Scale (KIPS) is a brief practical tool to assess the quality of parenting interactions across 12 parenting behaviors. Family service providers from a variety of education, health, and social service settings can use KIPS to identify parenting strengths and needs. In this article, the authors describe the rating…
Descriptors: Disadvantaged Youth, Family Programs, Parents as Teachers, Child Rearing
August, Diane; Francis, David J.; Hsu, Han-Ya Annie; Snow, Catherine E. – Elementary School Journal, 2006
A new measure of reading comprehension, the Diagnostic Assessment of Reading Comprehension (DARC), designed to reflect central comprehension processes while minimizing decoding and language demands, was pilot tested. We conducted three pilot studies to assess the DARC's feasibility, reliability, comparability across Spanish and English,…
Descriptors: Reading Comprehension, Bilingual Students, Evaluation Methods, Spanish Speaking

Direct link
