Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 312 |
| Since 2007 (last 20 years) | 639 |
Descriptor
| Statistical Analysis | 1074 |
| Test Reliability | 1074 |
| Test Validity | 613 |
| Foreign Countries | 362 |
| Factor Analysis | 307 |
| Test Construction | 297 |
| Correlation | 251 |
| Psychometrics | 176 |
| Questionnaires | 155 |
| Scores | 147 |
| College Students | 119 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 8 |
| Brennan, Robert L. | 6 |
| Irvin, P. Shawn | 6 |
| Lai, Cheng-Fei | 6 |
| Livingston, Samuel A. | 6 |
| Park, Bitnara Jasmine | 6 |
| Tindal, Gerald | 6 |
| Feldt, Leonard S. | 4 |
| Harris, Chester W. | 4 |
| Huynh, Huynh | 4 |
| Lembke, Erica S. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 14 |
| Practitioners | 9 |
| Students | 3 |
| Teachers | 3 |
Location
| Turkey | 97 |
| California | 16 |
| Germany | 16 |
| Australia | 15 |
| China | 14 |
| Iran | 14 |
| Jordan | 14 |
| United Kingdom | 13 |
| Canada | 12 |
| Malaysia | 10 |
| Spain | 9 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 2 |
| Individuals with Disabilities… | 2 |
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 2 |
| Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Thomas, Ruth G.; Bruning, Charles R. – Measurement and Evaluation in Guidance, 1981
Investigated the stabilities and construct validities of the Career Salience Questionnaire (CS) and the Central Life Interests Questionnaire (CLI) after minor modifications. Results indicated the modified CS and CLI are reliable for experimental use and that the CS and CLI tap different aspects of an "importance of work" construct. (RC)
Descriptors: Career Choice, Career Development, Job Satisfaction, Measures (Individuals)
Peer reviewedMcGrew, Kevin; Murphy, Suzanne – Journal of School Psychology, 1995
Investigates the general factor and uniqueness characteristics of the individual tests of the Woodcock-Johnson Test of Cognitive Ability-Revised (WJTCA-R). Only 2 of the 19 WJTCA-R tests examined had low general factor loadings, while 2 had low uniqueness. All other tests had medium or high uniqueness. Discusses implications for clinical…
Descriptors: Academic Ability, Cognitive Ability, Intelligence, Intelligence Tests
Peer reviewedWilder, Lynn K.; Sudweeks, Richard R. – Education and Treatment of Children, 2003
This study describes and documents reliability reporting practices in dissertation studies that have used the "Behavior Assessment System for Children" (BASC). Only 9 of 106 studies examined reported reliability for subpopulations. The lack of reliability score estimates has implications for use of the BASC to identify culturally diverse…
Descriptors: Behavior Rating Scales, Cultural Differences, Doctoral Dissertations, Elementary Secondary Education
Assessing the Evidence: Different Types of NVQ Evidence and Their Impact on Reliability and Fairness
Greatorex, Jackie – Journal of Vocational Education and Training, 2005
The research literature reveals that there are many factors that influence the consistency of assessors' or examiners' judgements. One issue that has not been considered is whether National Vocational Qualifications assessors' consistency of judgement is affected by different types of evidence. In this article, 15 Customer Service and 12 Assessor…
Descriptors: Qualifications, Examiners, Interrater Reliability, Job Applicants
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – ETS Research Report Series, 2006
This study addresses the sample error and linking bias that occur with small and unrepresentative samples in a non-equivalent groups anchor test (NEAT) design. We propose a linking method called the "synthetic function," which is a weighted average of the identity function (the trivial equating function for forms that are known to be…
Descriptors: Equated Scores, Sample Size, Test Items, Statistical Bias
Wright, Daniel B.; Skagerberg, Elin M. – Psychology Teaching Review, 2006
Multiple choice questions (MCQs) are becoming more common in UK psychology departments and the need to assess their reliability is apparent. Having examined the reliability of MCQs in our department we faced many questions from colleagues about why we were examining reliability, what it was that we were doing, and what should be reported when…
Descriptors: Psychology, Foreign Countries, Student Evaluation, Evaluation Methods
Livingston, Samuel A. – 1984
Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…
Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models
Lenke, Joanne M.; And Others – 1977
To investigate the effect of violating the assumption of equal item difficulty on Kuder-Richardson (KR) Formula 21 reliability coefficient, 670 eighth-and ninth- grade students were administered 26 short, homogeneous "tests" of mathematics concepts and skills. Both KR Formula 20 and KR Formula 21 were used to estimate reliability on each…
Descriptors: Comparative Analysis, Diagnostic Tests, Difficulty Level, Item Analysis
Harris, Chester W. – 1975
Achievement tests which are specifically linked to an instructional program and have been developed in relation to an objectives base and/or to an item generation rule are considered, as well as student response data. Three types of studies are outlined and the kind of procedures thought useful illustrated. As various methods for examining…
Descriptors: Achievement Tests, Instructional Programs, Item Banks, Item Sampling
Shoemaker, David M. – 1972
Described and listed herein with concomitant sample input and output is the Fortran IV program which estimates parameters and standard errors of estimate per parameters for parameters estimated through multiple matrix sampling. The specific program is an improved and expanded version of an earlier version. (Author/BJG)
Descriptors: Computer Oriented Programs, Computer Programs, Error of Measurement, Error Patterns
PDF pending restorationLarsson, Bernt – 1974
This report gives some simple examples of stability for one factor and 2 x 2 factorial analysis of variance, reliability and correlations. The findings are very different: from superstability (no transformation whatsoever can change the result) to almost total instability. This is followed by a discussion of applications to multivariate analysis,…
Descriptors: Analysis of Variance, Correlation, Discriminant Analysis, Factor Analysis
Crumpton, John – 1974
Intended for employers concerned about problem solving and communication within their organization, this document outlines a strategy for developing an instrument that would provide objective data rather than impressionistic data. A questionnaire was designed to explore the relationship between the central constructs of problem solving and…
Descriptors: Communication Problems, Communication Skills, Needs, Organizational Climate
PDF pending restorationHarris, Chester W. – 1971
Livingston's work is a careful analysis of what occurs when one pools two populations with different means, but similar variances and reliability coefficients. However, his work fails to advance reliability theory for the special case of criterion-referenced testing. See ED 042 802 for Livingston's paper. (MS)
Descriptors: Analysis of Variance, Criterion Referenced Tests, Error of Measurement, Reliability
Michigan State Dept. of Education, Lansing. – 1971
This report describes the development of the 1969-70 Michigan Educational Assessment measures used in assessing the levels and distribution of educational performance for Michigan's districts, schools, and pupils. The report has four sections. The first section contains a brief description of the 1969-70 assessment program, including a statement…
Descriptors: Achievement Tests, Attitude Measures, Educational Testing, Measurement Instruments
St. Pierre, Robert G. – Evaluation Quarterly, 1978
Data from the national evaluation of Project Follow Through were analyzed using analysis of covariance with and without correcting the pretest for unreliability. Such corrections led to some changes in conclusions. There are many disagreements in the literature about the appropriateness of correction for unreliability. (Author/CTM)
Descriptors: Analysis of Covariance, Data Analysis, Error Patterns, Pretests Posttests

Direct link
