Publication Date
| In 2026 | 1 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 214 |
| Since 2017 (last 10 years) | 495 |
| Since 2007 (last 20 years) | 987 |
Descriptor
| Test Validity | 3911 |
| Test Reliability | 1519 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 618 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 495 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Stricker, Lawrence J; And Others – 1972
This study's aim was to assess the validity of naive subjects' implicit personality theories, the correspondence among the theories, and the influence of social desirability on them. High school girls classified the items from the MMPI Psychopathic Deviate scale into clusters representing different traits. These clusters agreed closely with the…
Descriptors: Factor Analysis, Females, High School Seniors, High School Students
Gearhart, Maryl; Novak, John R.; Herman, Joan L. – 1994
Technical questions regarding the reliability and validity of large-scale portfolio assessment were studied which focused on: (1) whether raters can score collections of writing reliably with rubrics designed for single samples; (2) whether ratings derived from different frameworks differ in their capacities to support technically sound…
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Essay Tests
Short, Francis X.; Winnick, Joseph P. – 1998
This monograph documents the basis for selection of test items and health-related, criterion-referenced standards associated with the Brockport Physical Fitness Test (BPFT), a criterion-referenced fitness test for children and adolescents with disabilities. The manual is divided into separate chapters for the relevant components or sub-components…
Descriptors: Adolescents, Aerobics, Body Composition, Child Health
Peer reviewedLeitzel, Thomas C.; Vogler, Daniel E. – Journal of Applied Research in the Community College, 1995
Reviews study that found, through use of a "performance instruction" model, a significant misalignment between the cognitive complexity levels of planned course content and that which is tested. (32 citations) (YKH)
Descriptors: Cognitive Processes, Community Colleges, Correlation, Course Content
Brown, Gavin T. L.; Glasswell, Kath; Harland, Don – Assessing Writing, 2004
Accuracy in the scoring of writing is critical if standardized tasks are to be used in a national assessment scheme. Three approaches to establishing accuracy (i.e., consensus, consistency, and measurement) exist and commonly large-scale assessment programs of primary school writing demonstrate adjacent agreement consensus rates of between 80% and…
Descriptors: Writing Evaluation, Student Evaluation, Educational Assessment, Writing Tests
PDF pending restorationRaju, Nambury S.; And Others – 1992
In March 1992 the Arizona State Department of Education and educators across the state conducted a pilot study of 67 performance assessments developed for the Arizona Student Assessment Program (ASAP). This report describes various aspects of the reliability and validity of the 67 assessments (primarily constructed response) developed by The…
Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Grade 12
Wangerin, Paul T. – 1994
This paper addresses problems confronting law school teachers in grading law school exams and assigning letter grades. Using prototypical dialogue and scenarios, the paper examines mathematical and statistical issues that contribute to grading errors. Discussed in relation to real world data and the bar exam are: differential weighting, combining…
Descriptors: Civil Rights, Court Litigation, Educational Malpractice, Error of Measurement
Carifio, James – 1992
Researchers and program evaluators would often like to use a particular instrument, but do not because it is too long or would require too much testing time. Having a validated set of objective procedures for reducing the size of an instrument could improve many research and evaluation efforts. This paper reports the results of test reduction or…
Descriptors: Attitude Measures, Elementary School Students, Factor Analysis, Intermediate Grades
Radnor, Hilary – 1993
The Moderation and Assessment Project, South West, was an outgrowth of the Technical and Vocational Educational Initiative of the government of the United Kingdom that attempted to develop more courses with vocational relevance for adolescents. Growing from research projects under the Moderation and Assessment project, a new model of moderation is…
Descriptors: Curriculum Development, Education Work Relationship, Educational Assessment, Evaluation Methods
California State Postsecondary Education Commission, Sacramento. – 1990
This report of the California State Postsecondary Education Commission discusses standardized testing at the higher education level in California and is comprised of three parts. Part 1 describes six tests related to undergraduate admission, placement, and financial assistance. Part 2 describes five tests required by graduate programs and…
Descriptors: Access to Education, College Admission, Comparative Analysis, Educational Testing
Martin, Nancy K.; Baldwin, Beatrice – 1993
A preliminary investigation was conducted of the construct validity of the Inventory of Classroom Management Style (ICMS), a scale to measure differences in perceptions of classroom management style. The main objective was to determine if the scale reflects differences between novice and experienced teachers. Classroom management is defined as a…
Descriptors: Beliefs, Classroom Techniques, College Students, Comparative Testing
Busch, John Christian; Jaeger, Richard M. – 1989
The role of expert judges in establishing the content validity of the National Teacher Examinations (NTE) was examined in a detailed study. The NTE are used in 22 states for screening applicants to teacher education programs and/or for screening candidates for initial teacher certification. Such extensive use of the NTE has stimulated 35 recent…
Descriptors: College Entrance Examinations, Content Validity, Data Collection, Evaluators
Ernest, Patricia S.; And Others – 1986
These papers were presented as a symposium on the revision of the general education program at the University of Montevallo (UM) in Alabama. The first paper, "Introduction to Evaluation of General Education at the University of Montevallo," by Patricia S. Ernest and Elizabeth H. Rodgers, describes the core curriculum study and the 13…
Descriptors: College Curriculum, College Faculty, Core Curriculum, Curriculum Evaluation
Alaska State Dept. of Education, Juneau. Office of Evaluation, Assessment and Research. – 1985
Alaskan students' scores on the Scholastic Aptitude Test (SAT) increased nine points between 1984 and 1985, matching the national gain. These scores marked the fourth year of increases following 17 years of consistently declining scores. Thirty-three percent of Alaska's high school seniors took the SAT in 1985. The combined score of 923 was 17…
Descriptors: Academic Aptitude, College Entrance Examinations, Educational Trends, High Schools
Kohr, Richard L., Comp.; And Others – 1983
This guide begins with a series of questions and answers that introduce Pennsylvania's Educational Quality Assessment (EQA) Inventory as a 188- to 190-item multiple-choice test for grades 5, 8, and 11. Items are selected from a 400-item bank using matrix sampling procedures. Test results are analyzed at the school level; no individual student…
Descriptors: Achievement Tests, Affective Measures, Affective Objectives, Basic Skills

Direct link
