Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Firmin, Michael W.; Proemmel, Elizabeth; Hwang, Chi-en – Educational Research Quarterly, 2005
Previous studies have compared the accuracy of parent, teacher, and clinician ratings of children behavior, especially in diagnostic analysis. However, many have questioned the validity of the tests and the value of each rater. While some research has found differences among raters, few had looked at samples of non-referred children. We wanted to…
Descriptors: Parent Attitudes, Teacher Attitudes, Comparative Analysis, Child Behavior
Hanley, Barbara; Tasse, Marc J.; Aman, Michael G.; Pace, Pamela – Journal of Child and Family Studies, 2003
We studied 205 low-income families, using the Family Needs Scale (FNS). Factor analysis of the FNS data resulted on a 7-factor solution with high internal consistency within the various subscales. We provide normative scores based on the factor structure of the FNS. A total of 53 parents completed the FNS on two occasions with an average of four…
Descriptors: Family Needs, Low Income, Test Reliability, Interrater Reliability
McAtee, Michelle; Carr, Edward G.; Schulte, Christine; Dunlap, Glen – Journal of Positive Behavior Interventions, 2004
Problem behavior is a primary barrier to successful community inclusion for people with developmental disabilities and therefore a major priority for intervention efforts. Recently, researchers and clinicians have begun to focus on the systematic assessment of a broad range of contextual variables that purportedly affect problem behavior. In the…
Descriptors: Mental Retardation, Developmental Disabilities, Interrater Reliability, Behavior Problems
Lee, Donghyuck; Pfeiffer, Steven I. – Journal of Psychoeducational Assessment, 2006
This study examined the reliability and validity of a Korean-translated version of the Gifted Rating Scales--School Form (GRS-S) and explored the effect of gender, rater, and grade. Data were collected from elementary schools in a metropolitan area and a midsize town in South Korea. In all, 49 elementary school teachers and 272 parents…
Descriptors: Reliability, Validity, Gifted, Rating Scales
Surface, Eric A.; Dierdorff, Erich C. – Foreign Language Annals, 2003
The reliability of the ACTFL Oral Proficiency Interview (OPI) has not been reported since ACTFL revised its speaking proficiency guidelines in 1999. Reliability data for assessments should be reported periodically to provide users with enough information to evaluate the psychometric characteristics of the assessment. This study provided the most…
Descriptors: Language Tests, Interrater Reliability, Program Effectiveness, Psychometrics
Peer reviewedBurnett, J. Dale – Educational and Psychological Measurement, 1974
The general use of the Spearman-Brown formula for calculating the reliability of parallel tests with different lengths is reviewed. The importance of the assumption that the component tests be parallel is noted and the property that parallel tests must be non-negatively correlated is derived. (Author)
Descriptors: Statistical Analysis, Test Reliability, Testing Problems
Peer reviewedKristof, Walter – Psychometrika, 1974
Descriptors: Hypothesis Testing, Statistical Bias, Test Reliability
Levine, Michael V.; Saxe, David H. – 1976
A novel use of periodic functions, called the periodic procedure, was recently introduced to make practical the use of certain physical measurement ideas in psychology. This paper reports a successful attempt to apply the periodic procedure to an important psychological measurement problem, the measurement of aptitude test item difficulties. The…
Descriptors: Aptitude Tests, Item Analysis, Test Reliability
Peer reviewedMeyer, Edward P. – Educational and Psychological Measurement, 1975
Bounds are obtained for a coefficient proposed by Kaiser as a measure of average correlation and the coefficient is given an interpretation in the context of reliability theory. It is suggested that the root-mean-square intercorrelation may be a more appropriate measure of degree of relationships among a group of variables. (Author)
Descriptors: Correlation, Matrices, Statistical Analysis, Test Reliability
Peer reviewedRubin, Donald B. – Journal of Educational Psychology, 1974
Randomization should be employed whenever possible but the use of carefully controlled nonrandomized data to estimate causal effects is a reasonable and necessary procedure in many cases. (Author/BJG)
Descriptors: Predictive Validity, Reliability, Research Design, Sampling
Boshier, Roger – Psychol Rep, 1969
Descriptors: Adult Education, Motivation, Reliability, Research
Cleary, T. Anne; Linn, Robert L. – J Educ Meas, 1969
Descriptors: Correlation, Psychometrics, Statistical Data, Test Reliability
Mascaro, Guillermo F. – Percept Mot Skills, 1969
Descriptors: Behavior, Codification, Reliability, Research Problems
Bureau of Employment Security (DOL), Washington, DC. – 1965
THIS STUDY WAS CONDUCTED TO (1) DETERMINE WHETHER DIFFERENCES IN TYPING TEST LENGTH, CONTENT, FORMAT, AND TIME LIMIT HAVE ANY EFFECT ON TEST SCORES, AND (2) COMPARE THE RELIABILITY OF 5-MINUTE TYPING TESTS WITH THAT OF 10-MINUTE TESTS. TWO EQUIVALENT FORMS OF THE U.S. EMPLOYMENT SERVICE TYPING TEST AND ONE FORM OF THE U.S. CIVIL SERVICE COMMISSION…
Descriptors: Comparative Testing, Test Reliability, Tests, Typewriting
CRONBACH, LEE J.; AND OTHERS – 1967
A MEASURING OPERATION IS A SAMPLE FROM A UNIVERSE OF ADMISSIBLE OBSERVATIONS....GENERALIZABILITY STUDIES ESTIMATE THE MAGNITUDE OF THE DISCREPENCIES LIKELY TO ARISE UNDER A GIVEN MEASURING PROCEDURE, AND PROVIDE FORMULAS FOR ESTABLISHING INTERVAL AND POINT ESTIMATES OF THE UNIVERSE SCORE. A MULTIFACET GENERALIZABILITY ANALYSIS DEPARTS IN SEVERAL…
Descriptors: Behavior, Measurement, Reliability, Statistical Analysis

Direct link
