Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Armstrong, Patrick Ian; Allison, Wyndolyn; Rounds, James – Journal of Vocational Behavior, 2008
Although commercially developed interest measures based on Holland's RIASEC types are effectively used in a variety of applied settings, these measures have somewhat limited research utility due to their length and copyright restrictions placed by the test publishers. In the present study, two sets of 8-item RIASEC scales were developed using…
Descriptors: College Students, Copyrights, Validity, Vocational Interests
Jordan, Jeremy S.; Turner, Brian A. – Measurement in Physical Education and Exercise Science, 2008
Researchers in a number of disciplines have examined the utility of single-item measures for both affective and cognitive constructs. While these authors have indicated that, under certain circumstances, the use of single-item measures is appropriate, there remains concern regarding the reliability and validity of single-item measures. This study…
Descriptors: Job Satisfaction, Test Reliability, Test Validity, Measures (Individuals)
New York State Education Department, 2014
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. The purpose of this report is to document the technical aspects of the 2013-14 NYSAA.…
Descriptors: Alternative Assessment, Educational Assessment, State Departments of Education, Student Evaluation
Peer reviewedRaju, Nambury S. – Psychometrika, 1977
Coefficient Alpha can be used to estimate the reliability of a test when the test is split into several parts. It is known that alpha can severly underestimate test reliability when the parts have an unequal number of items. A generalization of alpha is proposed to correct this defect. (Author/JKS)
Descriptors: Mathematical Models, Measurement, Test Reliability
Peer reviewedSedere, M. U.; Feldt, Leonard S. – Journal of Educational Measurement, 1977
Two new reliability coefficients have been derived for situations in which a test must be divided into parts of unequal length. This report summarizes a study of the statistical bias and the standard errors of these coefficients and compares them to Guttman's lambda coefficients and Cronbach's alpha coefficient. (Author/JKS)
Descriptors: Measurement, Statistical Bias, Test Reliability
Peer reviewedPearson, Judith E.; Long, Thomas J. – Measurement and Evaluation in Counseling and Development, 1985
Reports the test-retest reliabilities for the Schedule of Recent Experiences (item counts) and the Recent Life Changes Questionnaire and compares the two scales. Subjects (N=109) were men and women enlisted in the US military reserves. Results indicated the two questionnaires demonstrate acceptable test-retest reliability. (BH)
Descriptors: Military Personnel, Scoring, Test Reliability
Peer reviewedAtkinson, Rick Paul – Journal of Autism and Developmental Disorders, 1984
The Autism Reinforcer Checklist was found to have adequate reliability when used with 29 autistic children (preschool-early adolescence). Results indicated highest and lowest ranked reinforcers of the edible, material, social, and activity type. (CL)
Descriptors: Autism, Positive Reinforcement, Test Reliability
Peer reviewedMcGary, Barbara A.; Burns, John A. – Journal of Educational and Psychological Measurement, 1974
Descriptors: Computer Programs, Scaling, Test Reliability
Stallings, William M.; Anderson, Frances E. – J Educ Meas, 1969
Descriptors: Scoring, Test Reliability, Test Validity
Peer reviewedten Berge, Jos M. F.; And Others – Psychometrika, 1981
Several algorithms for computing the greatest lower bound to reliability or the constrained minimum-trace communality solution in factor analysis have been developed. The convergence properties of these methods are examined. A uniqueness proof for the desired solution is offered. (Author/JKS)
Descriptors: Algorithms, Factor Analysis, Test Reliability
Peer reviewedVegelius, Jan – Educational and Psychological Measurement, 1980
One argument against the G index is that, unlike phi, it is not a correlation coefficient; yet, G conforms to the Kendall and E-coefficient definitions. The G index is also equal to the Pearson product moment correlation coefficient obtained from double scoring. (Author/CP)
Descriptors: Correlation, Mathematical Formulas, Test Reliability
Peer reviewedTarter, Ralph E.; And Others – Journal of Child and Adolescent Substance Abuse, 1994
Examines psychometric reliability of Drug Use Screening Inventory (DUSI) utilizing adolescents with DSM-III-R diagnosis of Psychoactive Substance Use Disorder. Concludes that split-half, internal, and test-retest reliability is superior. Suggests that DUSI may be useful for identifying and quantifying substance use and related problems. Includes…
Descriptors: Adolescents, Alcohol Abuse, Test Reliability
Heh, Peter – ProQuest LLC, 2009
The current study examined the validation and alignment of the PASA-Science by determining whether the alternate science assessment anchors linked to the regular education science anchors; whether the PASA-Science assessment items are science; whether the PASA-Science assessment items linked to the alternate science eligible content, and what…
Descriptors: Program Effectiveness, Special Education, Science Education, Science Tests
Setzer, J. Carl; He, Yi – GED Testing Service, 2009
Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…
Descriptors: Educational Research, Error of Measurement, Scores, Test Reliability
Murphy, Timothy; MacLaren, Iain; Flynn, Sharon – International Journal of Teaching and Learning in Higher Education, 2009
This study examines various aspects of an effective teaching evaluation system. In particular, reference is made to the potential of Fink's (2008) four main dimensions of teaching as a summative evaluation model for effective teaching and learning. It is argued that these dimensions can be readily accommodated in a Teaching Portfolio process. The…
Descriptors: Portfolios (Background Materials), College Faculty, Teacher Effectiveness, Summative Evaluation

Direct link
