Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 312 |
| Since 2007 (last 20 years) | 639 |
Descriptor
| Statistical Analysis | 1074 |
| Test Reliability | 1074 |
| Test Validity | 613 |
| Foreign Countries | 362 |
| Factor Analysis | 307 |
| Test Construction | 297 |
| Correlation | 251 |
| Psychometrics | 176 |
| Questionnaires | 155 |
| Scores | 147 |
| College Students | 119 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 8 |
| Brennan, Robert L. | 6 |
| Irvin, P. Shawn | 6 |
| Lai, Cheng-Fei | 6 |
| Livingston, Samuel A. | 6 |
| Park, Bitnara Jasmine | 6 |
| Tindal, Gerald | 6 |
| Feldt, Leonard S. | 4 |
| Harris, Chester W. | 4 |
| Huynh, Huynh | 4 |
| Lembke, Erica S. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 14 |
| Practitioners | 9 |
| Students | 3 |
| Teachers | 3 |
Location
| Turkey | 97 |
| California | 16 |
| Germany | 16 |
| Australia | 15 |
| China | 14 |
| Iran | 14 |
| Jordan | 14 |
| United Kingdom | 13 |
| Canada | 12 |
| Malaysia | 10 |
| Spain | 9 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 2 |
| Individuals with Disabilities… | 2 |
| Individuals with Disabilities… | 2 |
| No Child Left Behind Act 2001 | 2 |
| Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedKorner, Anneliese F.; And Others – Child Development, 1987
A neurobehavioral maturity assessment which used data from preterm infants was developed. Eight dimensions of neurobehavioral functioning were found to be stable, nonredundant, and developmentally valid. (PCB)
Descriptors: Individual Development, Infant Behavior, Measures (Individuals), Neonates
Peer reviewedKingma, Johannes; Reuvekamp, Johan – Educational and Psychological Measurement, 1984
The relationship between seriation tasks and number line comprehension tasks is reported. Types of tasks administered were six seriation tasks derived from Piaget, six seriation tasks provided with irrelevant cues, and number line comprehension tasks. Stochastic Mokken scale analysis indicated selection of six original seriation tasks formed a…
Descriptors: Developmental Stages, Elementary School Mathematics, Foreign Countries, Measurement Techniques
Peer reviewedBriere, Eugene J.; Brown, Richard H. – TESOL Quarterly, 1971
Revised version of a paper presented at the TESOL Convention in New Orleans, Louisiana, March 1971. Project funded by the Bureau of Indian Affairs. (VM)
Descriptors: American Indians, English (Second Language), Language Instruction, Language Skills
Peer reviewedBruvold, William H.; And Others – Educational and Psychological Measurement, 1983
Initial assessments of the item characteristics, reliability, and validity of the Inquiry Mode Questionnaire (IMQ) are reported. The IMQ was developed by (Harrison and Bramson) to measure the characteristic thinking style of individuals on five major dimensions: synthesist, idealist, analyst, realist, and pragmatist. (Author/PN)
Descriptors: Cognitive Style, Correlation, Factor Structure, Item Analysis
Peer reviewedTalmage, Harriet; Rasher, Sue Pinzur – Journal of Nutrition Education, 1981
Provides a brief overview of validity and reliability as concepts related to the overall quality of test instruments. Describes the nature and interpretation of content, face, criterion, and construct validity and identifies several approaches for measurement and improvement of reliability. (Author/CS)
Descriptors: Elementary Secondary Education, Higher Education, Resource Materials, Science Education
Peer reviewedAnd Others; Liu, Philip – Journal of Medical Education, 1980
A method of statistically analyzing clinical performance examinations for reliability and the application of this method in determining the reliability of two examinations of skill in administering anesthesia are described. Videotaped performances for the Spinal Anesthesia Skill Examination and the Anesthesia Setup and Machine Checkout Examination…
Descriptors: Clinical Experience, Higher Education, Medical Education, Performance
Peer reviewedHuynh, Huynh; Saunders, Joseph C. – Journal of Educational Measurement, 1980
Single administration (beta-binomial) estimates for the raw agreement index p and the corrected-for-chance kappa index in mastery testing are compared with those based on two test administrations in terms of estimation bias and sampling variability. Bias is about 2.5 percent for p and 10 percent for kappa. (Author/RL)
Descriptors: Comparative Analysis, Error of Measurement, Mastery Tests, Mathematical Models
Peer reviewedSilverstein, A.B.; And Others – Psychology in the Schools, 1980
Internal consistency, alternate form, and stability coefficients for the CAK-C were obtained for a sample of educable mentally retarded children. The reliability of the CAK-C is judged to be satisfactory for EMR children. (Author)
Descriptors: Handicapped Children, Mental Retardation, Mild Mental Retardation, Performance Factors
Wainer, Howard; And Others – 1991
It is sometimes sensible to think of the fundamental unit of test construction as being larger than an individual item. This unit, dubbed the testlet, must pass muster in the same way that items do. One criterion of a good item is the absence of differential item functioning (DIF). The item must function in the same way as all important…
Descriptors: Definitions, Identification, Item Bias, Item Response Theory
Houser, Ronald L.; And Others – 1983
This report describes a procedure that promises to improve the stability, accuracy, and efficiency of the employment of latent trait models and an application of the procedure to the Rasch model. Data were collected from the Portland Public Schools Level Tests administered to 25,740 students. Since each of the 173 items (chosen from the total…
Descriptors: Academic Achievement, Educational Testing, Item Banks, Latent Trait Theory
Levine, Michael V. – 1976
It is shown that empirical mental test P - P plots are approximately equal to theoretical item-item curves, at least for long tests administered to many people. This result is important because it leads to (1) a distribution free method for estimating points on item-item curves; (2) a general method for defining estimates of item parameters; and…
Descriptors: Item Analysis, Latent Trait Theory, Mathematical Applications, Mathematical Models
COX, RICHARD C. – 1965
THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…
Descriptors: Achievement Tests, Classification, Educational Objectives, Item Analysis
NUSS, EUGENE M.; ROOKEY, ERNEST J. – 1966
THIS STUDY WAS PHASE 1 OF A 2-PHASE PROJECT DESIGNED TO CHANGE TEACHER BEHAVIOR WITH THE OBJECTIVE OF IMPROVED CLASSROOM UTILIZATION OF NEW INSTRUCTIONAL MEDIA. PHASE 1 TRIED TO INCREASE TEACHER KNOWLEDGE OF MEDIA VIA A MONTHLY NEWSLETTER, "THE CIRCULATOR," DISTRIBUTED IN 5 DISSEMINATION PATTERNS TO 2,200 EDUCATORS IN PENNSYLVANIA. A…
Descriptors: Audiovisual Instruction, Behavior Change, Educational Media, Experiments
PDF pending restorationLovett, Hubert T. – 1975
The reliability of a criterion referenced test was defined as a measure of the degree to which the test discriminates between an individual's level of performance and a predetermined criterion level. The variances of observed and true scores were defined as the squared deviation of the score from the criterion. Based on these definitions and the…
Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Mathematical Models
Andrulis, Richard S.; And Others – 1974
The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…
Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)


