Publication Date
| In 2026 | 0 |
| Since 2025 | 354 |
| Since 2022 (last 5 years) | 1463 |
| Since 2017 (last 10 years) | 3331 |
| Since 2007 (last 20 years) | 5179 |
Descriptor
| Test Reliability | 9977 |
| Test Validity | 9977 |
| Test Construction | 3323 |
| Foreign Countries | 2919 |
| Psychometrics | 1818 |
| Factor Analysis | 1672 |
| Measures (Individuals) | 1329 |
| Evaluation Methods | 954 |
| Questionnaires | 931 |
| College Students | 868 |
| Factor Structure | 848 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 297 |
| Practitioners | 226 |
| Teachers | 84 |
| Administrators | 61 |
| Policymakers | 27 |
| Counselors | 25 |
| Students | 13 |
| Parents | 9 |
| Community | 5 |
| Support Staff | 5 |
Location
| Turkey | 688 |
| China | 175 |
| Australia | 171 |
| Canada | 146 |
| Indonesia | 120 |
| Spain | 106 |
| Taiwan | 91 |
| United States | 86 |
| Germany | 83 |
| United Kingdom | 82 |
| Malaysia | 77 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedLazarus, Mitchell – National Elementary Principal, 1975
Examines confusions that result from the use of the words "standardization,""reliability,""objectivity," and "validity" in connection with testing. (IRT)
Descriptors: Educational Testing, Elementary Secondary Education, Objective Tests, Standardized Tests
Peer reviewedFinch, A. J.; And Others – Journal of Personality Assessment, 1975
Descriptors: Comparative Analysis, Emotional Disturbances, Handicapped Children, Parents
Peer reviewedPugh, Richard C.; Brunza, J. Jay – Educational and Psychological Measurement, 1975
Descriptors: Analysis of Variance, Confidence Testing, Multiple Choice Tests, Personality
Connaughton, I. M. – Educ Res, 1969
Descriptors: Educational Testing, Essay Tests, Objective Tests, Predictive Validity
Byrd, Marquita L.; Williams, Hampton S. – 1981
These two related papers provide information on teacher attitudes toward black dialect use in the classroom and the measurement of such attitudes. The first paper reports on data from 176 administrators, counselors, teachers, and student teachers, revealing significant relationships between a teacher's definition of black dialect, attitudes toward…
Descriptors: Attitude Measures, Black Dialects, Classroom Communication, Communication Research
Stansfield, Charles – 1982
The secondary level English proficiency (SLEP) test is a group administered 150 item multiple-test of English language proficiency that includes two subscores and eight different item types. It is designed to assess a foreign student's readiness for English medium instruction at the secondary level. This paper reports on two studies which were…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Listening Comprehension
Fuchs, Lynn; And Others – 1981
A study was conducted to explore the reliability and validity of three prominent procedures used in informal reading inventories (IRIs): (1) choosing a 95% word recognition accuracy standard for determining student instructional level, (2) arbitrarily selecting a passage to represent the difficulty level of a basal reader, and (3) employing…
Descriptors: Elementary Education, Informal Reading Inventories, Reading Comprehension, Reading Instruction
Haladyna, Tom; Roid, Gale – 1981
Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…
Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction
Lambrecht, Judith J. – 1981
An aptitude test requiring 10-minutes' administration time was administered to high school students learning Forkner, Century 21, and Gregg shorthand for the purpose of determining test validity for different shorthand systems. Validity data were obtained from approximately 2000 students. Aptitude test reliability ranged from KR20=0.88 to 0.90.…
Descriptors: Academic Achievement, Aptitude Tests, Correlation, Dropout Rate
Stiggins, Richard J. – 1981
An area of current concern is that of the advantages and disadvantages of measuring writing proficiency directly via writing samples, and indirectly via objective tests. Much research has been completed documenting the correlation between direct and indirect measures. However, there had not yet been a systematic and detailed conceptual analysis…
Descriptors: Comparative Analysis, Elementary Secondary Education, Evaluation Methods, Higher Education
Metham, John – 1978
This paper reports upon the evaluation and implementation of a 30-item Likert-type rating scale for teachers to use in assessing children's behaviors within preschool classrooms. The Preschool Observation Scale (POS) was developed to evaluate programs of the Mt. Druitt Early Childhood Project, North Ryde, Australia. Items were constructed on the…
Descriptors: Achievement Need, Anxiety, Behavior Rating Scales, Child Language
Faggen, Jane – 1978
Formulas are presented for decision reliability and for classification validity for mastery/nonmastery decisions based on criterion referenced tests. Two item parameters are used: the probability of a master answering an item correctly, and the probability of a nonmaster answering an item incorrectly. The theory explores the relationships of…
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Item Analysis, Item Banks
PDF pending restorationMacPhee, David – 1980
The reliability and validity of three measures of infant temperament were compared in this study. The measures included the revised Carey (1978) Infant Temperament Questionnaire, a version of the Bayley (1969) Infant Behavior Record revised for completion by the parent, and a modified version of Buss and Plomin's (1975) EASI, an acronym standing…
Descriptors: Age Differences, Comparative Analysis, Factor Structure, Individual Differences
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction
Guay, Roland B. – 1980
The construct of spatial ability is discussed and it is suggested that some widely used and cited tests that are called spatial ability tests may not be valid measures of that ability. Instead, their items may be solved using mental processes that are clearly analytical and not spatial in nature. Four studies involving the analysis of subjects'…
Descriptors: Adults, Aptitude Tests, Cognitive Ability, Cognitive Processes


