Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedJoe, George W.; Woodward, J. Arthur – Psychometrika, 1976
This article is concerned with estimation of components of maximum generalizability in multifacet experimental designs involving multiple dependent measures. An example of a two-facet partially nested design is provided. (Author/RC)
Descriptors: Analysis of Variance, Correlation, Matrices, Reliability
Hendel, Darwin D.; Weiss, David J. – Educ Psychol Meas, 1970
It would appear that traditional modelsof reliability, in which reliability estimates for an individual are estimated from group data, could yiel more accurate estimates if individual difference variables, such as response consistency, were taken into consideration in the estimation of reliability. (DG)
Descriptors: Individual Differences, Measurement, Measurement Techniques, Rating Scales
Peer reviewedLepkin, Sheila Ratsch; Pryzwansky, Walter B. – Psychology in the Schools, 1983
Investigated the interrater reliability of teachers' and school psychology externs' scoring of protocols for the Developmental Test of Visual-Motor Integration (VMI), using a revised scoring system. Results showed high reliability coefficients for all raters, regardless of the scoring system employed. The influence of rater training is discussed.…
Descriptors: Interrater Reliability, Outcomes of Education, Preschool Teachers, Primary Education
Peer reviewedLantz, Annika; Friedrich, Peter – Learning Organization, 2003
A competence assessment instrument that measures cognitive complexity used structured interviews to investigate means-goal relationships in different work activities. Validity and reliability were confirmed by two tests of interrater reliability and six tests of validity (content, face, and criterion). (Contains 27 references.) (SK)
Descriptors: Competence, Interrater Reliability, Interviews, Lifelong Learning
Peer reviewedPolatajko, Helene; And Others – Canadian Journal of Occupational Therapy, 1993
Two occupational therapists rated 13 students after 1-week placements, using the Performance Evaluation of Occupational Therapy Students (PEOTS). The instrument had good interrater reliability but test-retest reliability was difficult to evaluate. Preliminary findings support the use of PEOTS as an evaluation tool. (JOW)
Descriptors: Clinical Experience, Interrater Reliability, Occupational Therapy, Student Evaluation
Peer reviewedBaume, David; Yorke, Mantz – Studies in Higher Education, 2002
Analyzed the assessments of 53 portfolios used to evaluate participants in a development course for higher education teachers at the United Kingdom's Open University. Findings included a high reliability in assessment at the level of course outcomes, and that cumulation of component assessments is very likely to reduce the reliability of overall…
Descriptors: Foreign Countries, Higher Education, Interrater Reliability, Portfolio Assessment
Peer reviewedGaudet, Laura; Pulos, Steve; Crethar, Hugh; Burger, Susan – Education and Training in Mental Retardation and Developmental Disabilities, 2002
In this study, self-reports of 34 individuals with developmental disabilities (DD) were compared with proxy ratings from family and providers. Correlations between the ratings of individuals with DD and the proxy raters were low, as were the correlations between family members and providers. In all scales except "cognition," the individual with DD…
Descriptors: Adults, Developmental Disabilities, Evaluation Methods, Interrater Reliability
Maydeu-Olivares, Alberto; Coffman, Donna L.; Hartmann, Wolfgang M. – Psychological Methods, 2007
The point estimate of sample coefficient alpha may provide a misleading impression of the reliability of the test score. Because sample coefficient alpha is consistently biased downward, it is more likely to yield a misleading impression of poor reliability. The magnitude of the bias is greatest precisely when the variability of sample alpha is…
Descriptors: Intervals, Scores, Sample Size, Simulation
Rae, Gordon – Psychological Methods, 2007
The relationship between stratified alpha (alpha-sub(s)) and the reliability of a test composed of interrelated nonhomogeneous items is examined. It is mathematically demonstrated that when there is congeneric equivalence within the strata or subtests, the difference between the coefficients is a function of the variances of the loadings within…
Descriptors: Test Reliability, Test Items, Computation, Error of Measurement
Miller, Dianna Bailey – ProQuest LLC, 2009
The purpose of this quantitative correlational research study was to examine the relationship between leadership styles of community college nurse educators in Texas and licensure passage rates of nursing community college graduates in Texas. Surveys were conducted to obtain the nurse educators' demographic data. The Multifactor Leadership…
Descriptors: Statistical Analysis, Correlation, Community Colleges, Nursing Education
Reynolds, Meree; Wheldall, Kevin; Madelaine, Alison – Australian Journal of Learning Difficulties, 2009
Early years teachers are in need of efficient measures to identify young students who are not making adequate progress in learning to read. The Wheldall Assessment of Reading Lists (WARL) has been developed to meet this need. The test, a curriculum-based measure of word identification fluency, consists of a series of parallel lists of frequently…
Descriptors: Curriculum Based Assessment, Word Lists, Test Construction, Reading Difficulties
Mazaheri, Mehrdad; Theuns, Peter – Social Indicators Research, 2009
The current study evaluates three hypothesized models on subjective well-being, comprising life domain ratings (LDR), overall satisfaction with life (OSWL), and overall dissatisfaction with life (ODWL), using structural equation modeling (SEM). A sample of 1,310 volunteering students, randomly assigned to six conditions, rated their overall life…
Descriptors: Life Satisfaction, Structural Equation Models, Well Being, Predictive Validity
Chafouleas, Sandra M.; Briesch, Amy M.; Riley-Tillman, T. Chris; McCoach, D. Betsy – School Psychology Quarterly, 2009
The purpose of this study was to develop and provide an initial examination of a self-report measure of intervention usage called the Usage Rating Profile-Intervention (URP-I). From an initial pool of 55 items, results of exploratory factor analysis and reliability estimates supported a measure containing 35 items and four factors as relevant…
Descriptors: Intervention, Factor Structure, Factor Analysis, Self Evaluation (Individuals)
Martinez, Rebecca S.; Missall, Kristen N.; Graney, Suzanne Bamonto; Aricak, O. Tolga; Clarke, Ben – Assessment for Effective Intervention, 2009
The current study examines the technical adequacy of four Early Numeracy Curriculum-Based Measurement (EN-CBM) screening tasks: "Oral Counting" (OC), "Number Identification" (NI), "Quantity Discrimination" (QD), and "Missing Number" (MN). Results from 59 kindergarten students assessed in the fall and spring reveal moderate to high test-retest and…
Descriptors: Curriculum Based Assessment, Numeracy, Predictive Validity, Kindergarten
Wuttiprom, Sura; Sharma, Manjula Devi; Johnston, Ian D.; Chitaree, Ratchapak; Soankwan, Chernchok – International Journal of Science Education, 2009
Conceptual surveys have become increasingly popular at many levels to probe various aspects of science education research such as measuring student understanding of basic concepts and assessing the effectiveness of pedagogical material. The aim of this study was to construct a valid and reliable multiple-choice conceptual survey to investigate…
Descriptors: Physics, Comprehension, Test Construction, Student Surveys

Direct link
