Publication Date
| In 2026 | 0 |
| Since 2025 | 354 |
| Since 2022 (last 5 years) | 1463 |
| Since 2017 (last 10 years) | 3331 |
| Since 2007 (last 20 years) | 5179 |
Descriptor
| Test Reliability | 9977 |
| Test Validity | 9977 |
| Test Construction | 3323 |
| Foreign Countries | 2919 |
| Psychometrics | 1818 |
| Factor Analysis | 1672 |
| Measures (Individuals) | 1329 |
| Evaluation Methods | 954 |
| Questionnaires | 931 |
| College Students | 868 |
| Factor Structure | 848 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 297 |
| Practitioners | 226 |
| Teachers | 84 |
| Administrators | 61 |
| Policymakers | 27 |
| Counselors | 25 |
| Students | 13 |
| Parents | 9 |
| Community | 5 |
| Support Staff | 5 |
Location
| Turkey | 688 |
| China | 175 |
| Australia | 171 |
| Canada | 146 |
| Indonesia | 120 |
| Spain | 106 |
| Taiwan | 91 |
| United States | 86 |
| Germany | 83 |
| United Kingdom | 82 |
| Malaysia | 77 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedHernon, Peter; McClure, Charles R. – Library and Information Science Research, 1987
Discusses issues relating to the reliability, validity, utility, and information value of unobtrusive testing of library reference services; provides suggestions for practical applications of these criteria; applies study findings to library decision making and planning; and identifies topics for further methodological refinement. (Author/CLB)
Descriptors: Data Analysis, Data Collection, Data Interpretation, Experimenter Characteristics
Gearhart, Maryl; Novak, John R.; Herman, Joan L. – 1994
Technical questions regarding the reliability and validity of large-scale portfolio assessment were studied which focused on: (1) whether raters can score collections of writing reliably with rubrics designed for single samples; (2) whether ratings derived from different frameworks differ in their capacities to support technically sound…
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Essay Tests
Guerrero, Michael D. – 1994
A study evaluated the overall evaluative validity of the Four Skills Exam, a Spanish language proficiency test designed to ensure that bilingual education teachers in New Mexico can meet Spanish language demands in the bilingual education classroom. The test's construct validity was limited for several reasons. In designing a test capturing…
Descriptors: Bilingual Education, Comparative Analysis, Construct Validity, Elementary Secondary Education
Srebnik, Debra – 1996
This paper discusses the results of a study that investigated the validity and reliability of the Ecology Rating Scale (ERS). The ERS is a brief, multi-dimensional level-of-functioning instrument that can be rated by parents or clinicians. The ERS is comprised of seven domains of youth functioning: family, school, emotional, legal/justice,…
Descriptors: Academic Achievement, Adolescents, Behavior Disorders, Child Health
Yeung, Ka Wah; Watkins, David – 1998
Scales were developed to assess both the custodial and humanistic aspects of student control ideology. Research based on the responses of about 500 Hong Kong teacher education students showed responses to these scales were of adequate internal consistency, and confirmatory factor analysis supported two independent scales rather than a continuum…
Descriptors: Classroom Techniques, Discipline, Elementary Secondary Education, Foreign Countries
McCarney, Stephen B.; Anderson, Paul D. – 2000
This technical manual provides an overview of the "Transition Behavior Scale-Second Edition Self-Report Version" (TBS-2 S-RV), an assessment that provides the means by which student behavior, which is predictive of employment and societal transition behavior, can be documented. Those students (ages 12-18) most likely to have problems in employment…
Descriptors: Adolescents, Behavior Rating Scales, Disabilities, Education Work Relationship
Short, Francis X.; Winnick, Joseph P. – 1998
This monograph documents the basis for selection of test items and health-related, criterion-referenced standards associated with the Brockport Physical Fitness Test (BPFT), a criterion-referenced fitness test for children and adolescents with disabilities. The manual is divided into separate chapters for the relevant components or sub-components…
Descriptors: Adolescents, Aerobics, Body Composition, Child Health
Peer reviewedMehrens, William A.; Popham, W. James – Applied Measurement in Education, 1992
This paper discusses how to determine whether a test was developed in a legally defensible manner, reviewing general issues, specific cases bearing on different types of test use, some evaluative dimensions, and evidence of test quality. Tests constructed and used according to existing standards will generally stand legal scrutiny. (SLD)
Descriptors: College Entrance Examinations, Compliance (Legal), Constitutional Law, Court Litigation
Peer reviewedHaladyna, Thomas M.; And Others – Educational Researcher, 1991
Because of the importance of standardized test scores in current definitions of educational achievement, pressure to raise test scores has affected their accuracy. Examines the causes of two major sources of test score pollution and their impact on education. Discusses the ethical status of documented test-preparation activities. (CJS)
Descriptors: Academic Achievement, Achievement Tests, Classroom Techniques, Criterion Referenced Tests
Peer reviewedMaxwell, Kelly L.; McWilliam, R. A.; Hemmeter, Mary Louise; Ault, Melinda Jones; Schuster, John W. – Early Childhood Research Quarterly, 2001
Tested psychometric properties of a new measure of developmentally appropriate practices in kindergarten through third grade and the predictive value of classroom and teacher characteristics. Found that grade, class size, number of children with disabilities, teacher educational level, teacher experience, and teacher beliefs accounted for 42…
Descriptors: Class Size, Classroom Environment, Developmentally Appropriate Practices, Elementary School Teachers
Brown, Gavin T. L.; Glasswell, Kath; Harland, Don – Assessing Writing, 2004
Accuracy in the scoring of writing is critical if standardized tasks are to be used in a national assessment scheme. Three approaches to establishing accuracy (i.e., consensus, consistency, and measurement) exist and commonly large-scale assessment programs of primary school writing demonstrate adjacent agreement consensus rates of between 80% and…
Descriptors: Writing Evaluation, Student Evaluation, Educational Assessment, Writing Tests
Bayer, Jordana K.; Sanson, Ann V.; Hemphill, Sheryl A. – Journal of Emotional and Behavioral Disorders, 2006
Internalizing disorders are a public health issue affecting up to 20% of school-age children, yet few receive assistance. Internalizing difficulties can emerge in the preschool years, with stability from this time onward. To inform prevention programs, knowledge is needed about early internalizing indicators in community samples. This study…
Descriptors: Questionnaires, Prevention, Young Children, Public Health
PDF pending restorationRaju, Nambury S.; And Others – 1992
In March 1992 the Arizona State Department of Education and educators across the state conducted a pilot study of 67 performance assessments developed for the Arizona Student Assessment Program (ASAP). This report describes various aspects of the reliability and validity of the 67 assessments (primarily constructed response) developed by The…
Descriptors: Academic Achievement, Educational Assessment, Elementary Secondary Education, Grade 12
Wangerin, Paul T. – 1994
This paper addresses problems confronting law school teachers in grading law school exams and assigning letter grades. Using prototypical dialogue and scenarios, the paper examines mathematical and statistical issues that contribute to grading errors. Discussed in relation to real world data and the bar exam are: differential weighting, combining…
Descriptors: Civil Rights, Court Litigation, Educational Malpractice, Error of Measurement
Vansickle, Timothy R. – 1992
The scaling of a new assessment is a significant undertaking. The scaling of a new assessment designed as a multiple-level, criterion-referenced assessment is even more so. A Guttman approach to scaling was used with the Work Keys selected-response assessments, Reading for Information and Applied Mathematics. Assessments in development in the Work…
Descriptors: Criterion Referenced Tests, Employment Qualifications, High School Students, High Schools

Direct link
