Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 312 |
| Since 2007 (last 20 years) | 639 |
Descriptor
| Statistical Analysis | 1074 |
| Test Reliability | 1074 |
| Test Validity | 613 |
| Foreign Countries | 362 |
| Factor Analysis | 307 |
| Test Construction | 297 |
| Correlation | 251 |
| Psychometrics | 176 |
| Questionnaires | 155 |
| Scores | 147 |
| College Students | 119 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 8 |
| Brennan, Robert L. | 6 |
| Irvin, P. Shawn | 6 |
| Lai, Cheng-Fei | 6 |
| Livingston, Samuel A. | 6 |
| Park, Bitnara Jasmine | 6 |
| Tindal, Gerald | 6 |
| Feldt, Leonard S. | 4 |
| Harris, Chester W. | 4 |
| Huynh, Huynh | 4 |
| Lembke, Erica S. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 14 |
| Practitioners | 9 |
| Students | 3 |
| Teachers | 3 |
Location
| Turkey | 97 |
| California | 16 |
| Germany | 16 |
| Australia | 15 |
| China | 14 |
| Iran | 14 |
| Jordan | 14 |
| United Kingdom | 13 |
| Canada | 12 |
| Malaysia | 10 |
| Spain | 9 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 2 |
| Individuals with Disabilities… | 2 |
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 2 |
| Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedPeng, Chao-Ying, J.; Subkoviak, Michael J. – Journal of Educational Measurement, 1980
Huynh (1976) suggested a method of approximating the reliability coefficient of a mastery test. The present study examines the accuracy of Huynh's approximation and also describes a computationally simpler approximation which appears to be generally more accurate than the former. (Author/RL)
Descriptors: Error of Measurement, Mastery Tests, Mathematical Models, Statistical Analysis
Subkoviak, Michael J. – 1976
A number of different definitions and indices of reliability for mastery tests have recently been proposed in an attempt to cope with possible lack of score variability that attenuates traditional coefficients. One promising index that has been suggested is the proportion of students in a group that are consistently assigned to the same mastery…
Descriptors: Criterion Referenced Tests, Mastery Tests, Mathematical Models, Scores
Peer reviewedAult, Ruth L.; And Others – Child Development, 1976
Two statistical characteristics of the Matching Familiar Figures test which produce methodological problems in reflection-impulsivity research are discussed. (BRT)
Descriptors: Conceptual Tempo, Elementary Education, Research Methodology, Research Problems
Peer reviewedJoe, George W.; Woodward, J. Arthur – Psychometrika, 1976
This article is concerned with estimation of components of maximum generalizability in multifacet experimental designs involving multiple dependent measures. An example of a two-facet partially nested design is provided. (Author/RC)
Descriptors: Analysis of Variance, Correlation, Matrices, Reliability
O'Shea, Arthur J.; Harrington, Thomas F. – Measurement and Evaluation in Guidance, 1980
Describes the procedures the authors of the System for Career Decision-Making (CDM) followed in establishing client scoring reliability. Authors recommend that manuals of self-scored inventories provide data establishing scorer reliability, that scoring be supervised, and that APGA test standards deal directly with scorer reliability. (Author)
Descriptors: Career Choice, College Students, Decision Making, Interest Inventories
Peer reviewedCharter, Richard A.; Feldt, Leonard S. – Measurement and Evaluation in Counseling and Development, 2002
Presented is a detailed description of two true score confidence interval approaches, their use, interpretation, and a philosophical conflict that arises in many applied instances. (Contains 27 references.) (Author)
Descriptors: Error of Measurement, Psychometrics, Research Methodology, Statistical Analysis
Peer reviewedWebster, Raymond E. – Psychology in the Schools, 1988
Examined temporal stability of Wechsler Intelligence Scale for Children-Revised (WISC-R) for adolescents (N=155) identified as either educable mentally retarded (EMR) or learning disabled (LD). Found three major scales of WISC-R to be more stable over three-year period for LD than for EMR group, while subtest scales for EMR group showed greater…
Descriptors: Adolescents, Learning Disabilities, Mild Mental Retardation, Special Education
Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008
The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring
Massa, Jacqueline; Gomes, Hilary; Tartter, Vivien; Wolfson, Virginia; Halperin, Jeffrey M. – International Journal of Language & Communication Disorders, 2008
Background: Research has shown that early identification of children with language issues is critical for effective intervention, and yet many children are not identified until school age. The use of parent-completed rating scales, especially in urban, minority populations, might improve early identification if parent ratings are found to be…
Descriptors: Speech Communication, Language Impairments, Reading Tests, Validity
Brown, James Dean; Ross, Jacqueline A. – 1993
This study investigates the Test of English as a Foreign Language (TOEFL), in particular the relative contributions to score dependability (analogous to classical theory reliability) of various numbers of items and subtests as well as the decision dependability at different cut points. Research questions that apply to the overall TOEFL battery and…
Descriptors: English (Second Language), Language Tests, Statistical Analysis, Test Reliability
Peer reviewedSabatino, David A.; And Others – Exceptional Children, 1974
Investigated with 129 kindergarten children were test-retest reliability and validity of the Developmental Test of Visual Perception (DTVP), and changes in factorial relationships among the five DTVP subtest after a 10-week use of the program of M. Frostig. (MC)
Descriptors: Exceptional Child Research, Kindergarten Children, Learning Disabilities, Statistical Analysis
Peer reviewedSchutz, Howard G.; Rucker, Margaret H. – Educational and Psychological Measurement, 1975
Data from 2-, 3-, 6-, and 7-point rating scales were analyzed to determine whether scale length affected response patterns. Results indicate that data configurations are relatively invariant with changes in number of scale points. (Author)
Descriptors: Data Collection, Factor Analysis, Questionnaires, Rating Scales
Johnson, Marion Lee – Res Quart AAHPER, 1969
Based on dissertation for EdD degree at University of Texas (1966).
Descriptors: Attitude Measures, Models, Research, Research Methodology
Berk, Ronald A. – 1980
Seventeen statistics for measuring the reliability of criterion-referenced tests were critically reviewed. The review was organized into two sections: (1) a discussion of preliminary considerations to provide a foundation for choosing the appropriate category of "reliability" (threshold loss function, squared-error loss-function, or…
Descriptors: Criterion Referenced Tests, Cutting Scores, Scoring Formulas, Statistical Analysis
Starkweather, Elizabeth K. – 1970
The Starkweather Social Conformity Test is a research instrument designed to measure conforming and nonconforming behavior by providing the young child with opportunities to make choices in a situation in which he can follow a model or respond freely according to his own preferences. The test discriminates between compulsive conformists or…
Descriptors: Age Differences, Conformity, Preschool Education, Preschool Tests

Direct link
