Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedGinther, Joan R. – Journal for Research in Mathematics Education, 1978
Pretraining on test-taking strategies improved the reliability and predictive power of the Arithmetic Reasoning Test for seventh-grade Chicano students but not for non-Chicano students. Pretraining did not improve the reliability of the Missing Words Test for either group, but appears to have improved the predictive power of this test for both…
Descriptors: Aptitude Tests, Educational Research, Grade 7, Mathematics Education
Ibe, Milagros D. – RELC Journal, 1975
This investigation seeks to determine: the validity and reliability of cloze tests for measuring reading comprehension; the relation of cloze scores to difficulty levels of reading passages; the relation of cloze test performance to length of English training; and the merits of judgmental vs. random word deletion in test construction. (DB)
Descriptors: Cloze Procedure, English (Second Language), Indochinese, Language Teachers
Peer reviewedAleamoni, Lawrence M.; Eitelbach, Sarah B. – Research in Higher Education, 1976
A comparison of two forms of the College Entrance Examination Board's (CEEB) English Composition Test with four rhetoric final examinations in a basic English composition course indicated that the CEEB was more stable and yielded better item statistics while departmental examinations were more highly related to course grade. (Editor/JT)
Descriptors: Criterion Referenced Tests, Educational Research, English Instruction, Grade Prediction
Peer reviewedReker, Gary T. – Journal of Clinical Psychology, 1977
This research assessed the reliability and validity of the Purpose in Life test in an inmate population, investigated the relationship between the PIL test and attitudes, locus of control, personality factors, and several demographic variables, and compared the PIL scores of inmates with scores of normal samples. (Author/RK)
Descriptors: Demography, Individual Characteristics, Locus of Control, Measurement Instruments
Peer reviewedSchmidt, Frank L.; And Others – Personnel Psychology, 1977
The adverse impact of a content-valid job sample test of metal trades skills was compared to that of a well-constructed content-valid written achievement test for the same technical area. The adverse impact of the former was considerably less. Suggests that industrial psychologists should explore more fully the potential of performance testing.…
Descriptors: Majority Attitudes, Minority Groups, Performance Tests, Research Design
An Analysis of the Higher School Certificate and University Performances of Early Admission Entrants
Watkins, David – Vestes, 1977
Subsequent academic performances of students admitted to the University of New England on the basis of a Principals' Report during 1972 and 1973 are examined. The validity of the Principals' Report as a method assessing suitability for early admission is upheld. (LBH)
Descriptors: Academic Achievement, Admission Criteria, Comparative Analysis, Early Admission
Peer reviewedTaber, Gary Davisson; Pfister, Guenter G. – Unterrichtspraxis, 1977
This article examines what types of language tests are suitable in evaluating instructional programs in testing for educational accountability. Norm-referenced measurements are found unsuitable for this purpose; new, criterion-referenced tests must be developed. (CHK)
Descriptors: Accountability, Achievement Tests, Criterion Referenced Tests, Language Instruction
Peer reviewedNorcinin, John J.; And Others – Journal of Medical Education, 1987
A study of the correlation between certification test results and ratings of clinical competence for graduate medical students in internal medicine during a six-year period found strong correlations on both individual and general indicators of competence. (MSE)
Descriptors: Certification, Competence, Graduate Medical Students, Higher Education
Peer reviewedKeith, Timothy Z. – School Psychology Review, 1987
This article highlights three common problems with assessment research, including the following needs: (1) to adopt a hypothesis testing approach; (2) for research to be guided by available formal and informal theory, and (3) for research to be consistent with general school psychological practice. New research methodologies and new research…
Descriptors: Educational Assessment, Educational Psychology, Educational Research, Elementary Secondary Education
Peer reviewedWoodburn, Mary Stuart – Reading Teacher, 1986
Concludes that the test has a well-designed reading booklet and a carefully constructed manual, but that it has a narrow applicability. (FL)
Descriptors: Elementary Secondary Education, Oral Reading, Reading Achievement, Reading Diagnosis
Peer reviewedDyson, Alice T.; Robinson, Thomas W. – Language, Speech, and Hearing Services in Schools, 1987
Speech samples from five phonologically disordered children, aged 3:5 to 6:5, were evaluated using the following instruments: Assessment of Phonological Processes, Natural Process Analysis, and Procedures for the Phonological Analysis of Children's Language (modified). Potential first remediation targets were generated, and, in general, the…
Descriptors: Articulation Impairments, Early Childhood Education, Error Analysis (Language), Evaluation Methods
Peer reviewedAleamoni, Lawrence M. – New Directions for Teaching and Learning, 1987
Eight of the most common faculty concerns about student evaluations of instruction are discussed: inconsistent student judgments, the perception that only colleagues are qualified to evaluate peers' instruction, student-rating schemes as popularity contests, unreliable and invalid student-rating forms, etc. Research shows that faculty concerns are…
Descriptors: College Faculty, College Instruction, Educational Research, Faculty Evaluation
Peer reviewedBraden, Jeffery P. – Journal of School Psychology, 1987
Showed that the standard score difference method for determining intelligence quotient achievement discrepencies produced disproportionate racial representation, whereas the regression method produced proportionate racial representation in Learning Disabilities (LD) classes. Demonstrated advantages in measurement of discrepancies, LD program…
Descriptors: Achievement Rating, Children, Comparative Analysis, Disability Identification
Peer reviewedFrary, Robert B. – Journal of Educational Measurement, 1985
Responses to a sample test were simulated for examinees under free-response and multiple-choice formats. Test score sets were correlated with randomly generated sets of unit-normal measures. The extent of superiority of free response tests was sufficiently small so that other considerations might justifiably dictate format choice. (Author/DWH)
Descriptors: Comparative Analysis, Computer Simulation, Essay Tests, Guessing (Tests)
Wnuk, Lu – Canadian Journal for Exceptional Children, 1987
Two language ability tests are discussed: the Receptive-Expressive Emergent Language Scale (REEL), measuring language competencies from birth through 36 months and the Test for Auditory Comprehension of Language (TACL), for children ages three to seven. Their efficacy in identifying language delays and in establishing criteria for effective…
Descriptors: Early Childhood Education, Educational Diagnosis, Handicap Identification, Language Acquisition


