Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Taylor, Marcia B; Porterfield, William D. – 1984
This paper describes the Measure of Epistemological Reflection (MER), an instrument to assess cognitive developmental level according to the Perry scheme of intellectual and ethical development. It contains sets of questions for each of the six cognitive domains: decision making, learner role, instructor role in the learning process, peer role in…
Descriptors: Cognitive Development, Cognitive Tests, Epistemology, Higher Education
Tillinghast, B. S., Jr.; Renzulli, Joseph S. – Journal of Educational Research, 1968
The purpose of this study was to further examine the reliability of the Peabody Picture Vocabulary Test (PPVT), a new instrument to measure hearing vocabulary so that a student's verbal intelligence may be inferred. A group testing procedure was utilized by reproducing the PPVT plates on 35 millimeter transparent slides and projecting them onto a…
Descriptors: Aptitude Tests, Elementary School Students, Evaluation, Group Testing
Livingston, Samuel A. – 1970
The assumptions of the classical test-theory model are used to develop a theory of reliability for criterion-referenced measures which parallels that for norm-referenced measures. It is shown that the Spearman-Brown formula holds for criterion-referenced measures and that the criterion-referenced reliability coefficient can be used to correct…
Descriptors: Correlation, Criterion Referenced Tests, Measurement Instruments, Norm Referenced Tests
Baker, J. Philip – 1971
The usefulness of generalizability theory in assessing the reliability of classroom observation instruments is illustrated, with a new index of reliability, called the coefficient of generalizability, given as an index of how well one can generalize from the instrument to the universe score according to the conditions of observation. Data from an…
Descriptors: Analysis of Variance, Bias, Classroom Observation Techniques, Data Analysis
Peer reviewedLivingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979
Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)
Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement
Peer reviewedMagnusson, D.; Backteman, G. – Applied Psychological Measurement, 1979
A longitudinal study of approximately 1,000 students aged 10-16 showed high stability of intelligence and creativity. Stability coefficients for intelligence were higher than those for creativity. Results supported the construct validity of creativity. (MH)
Descriptors: Creativity, Creativity Tests, Elementary Secondary Education, Foreign Countries
Rojahn, Johannes; Tasse, Marc J.; Sturmey, Peter – American Journal on Mental Retardation, 1997
Development of the Stereotyped Behavior Scale for adolescents and adults with mental retardation is described. Use with 600 individuals resulted in refinement and a 26-item scale with an internal consistency alpha of 0.88, test-retest reliability of p=0.90, and interrater reliability of p=0.76. (DB)
Descriptors: Adolescents, Adults, Behavior Patterns, Behavior Rating Scales
Beuttler, Marybeth Grant; Leininger, Peter M.; Palisano, Robert J. – Physical & Occupational Therapy in Pediatrics, 2004
Purpose: The purpose of this study was to examine the test-retest and inter-rater reliability of a measure of muscle extensibility developed by Tardieu, de la Tour, Bret, and Tardieu (1982) in fullterm and preterm newborns. Method: Twenty-one fullterm infants and twenty preterm infants were examined by two physical therapists. Each physical…
Descriptors: Premature Infants, Neonates, Human Body, Motor Development
Hafner, John C.; Hafner, Patti M. – International Journal of Science Education, 2003
Although the rubric has emerged as one of the most popular assessment tools in progressive educational programs, there is an unfortunate dearth of information in the literature quantifying the actual effectiveness of the rubric as an assessment tool "in the hands of the students." This study focuses on the validity and reliability of the rubric as…
Descriptors: Interrater Reliability, Generalizability Theory, Biology, Scoring Rubrics
Ashvind Nand Singh – ProQuest LLC, 2008
Due to the relative inability of individuals with intellectual disabilities (ID) to provide an accurate and reliable self-report, assessment in this population is more difficult than with individuals in the general population. As such, assessment procedures must be adjusted to compensate for the relative lack of information that the individual can…
Descriptors: Test Items, Item Analysis, Test Construction, Behavior Rating Scales
MacQuarrie, David; Applegate, Brooks; Lacefield, Warren – Journal of Career and Technical Education, 2008
Career and Technical Education (CTE) is a nationwide program that emphasizes training for primary, secondary, and post secondary educational stages for the career and workforce needs of today and tomorrow's society. Mandated indicators of success have been set in place and secondary schools are expected to improve student's skill levels in…
Descriptors: Criterion Referenced Tests, Content Validity, Test Validity, Test Reliability
Hardre, Patricia L.; Davis, Kendrick A.; Sullivan, David W. – Educational Research and Evaluation, 2008
In the field of educational psychology, there is diverse and active research in motivation for learning and achievement. Many instruments exist for assessing students' motivation, primarily as self-report. Fewer instruments are available for assessing "teachers'" perceptions of their students' motivation, and fewer still for assessing teachers'…
Descriptors: Student Attitudes, Educational Psychology, Student Motivation, Secondary School Teachers
Mahar, Matthew T.; Rowe, David A. – Measurement in Physical Education and Exercise Science, 2008
Accurate measures of youth fitness are needed by researchers and practitioners. Evidence of validity and reliability are essential before results of youth fitness tests can be used to make sound decisions. This article describes a three-stage paradigm for validation research and provides guidance for conducting and understanding norm-referenced…
Descriptors: Test Reliability, Test Validity, Guidelines, Physical Education Teachers
Carlozzi, Noelle E.; Long, Patricia J. – Journal of Interpersonal Violence, 2008
Two studies examined the psychometric properties of the Posttraumatic Stress Disorder (PTSD) subscale of the SCL-90-R. Study 1 examined SCL-90-R responses from 2,361 college women to determine whether this subscale can appropriately assess the three dimensions of PTSD. Factor analysis and Cronbach's alpha suggest that this subscale is best…
Descriptors: Females, Posttraumatic Stress Disorder, Test Reliability, Test Validity
Hester, Eva Jackson – College Teaching, 2008
Student evaluations of advising (SEA) are often limited to student ratings of the faculty member's advising skills. As with teaching evaluations, ratings of SEA may not be the best reflection of the advisor's performance. In this study, the author analyzed SEA to determine the relationship between student characteristics and evaluation items.…
Descriptors: Student Attitudes, Student Characteristics, Item Analysis, Student Evaluation of Teacher Performance

Direct link
