Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 281 |
| Since 2007 (last 20 years) | 518 |
Descriptor
| Testing Problems | 4851 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Baumgartner, Ted A. – Research Quarterly, 1978
The author describes a valid, reliable, and inexpensive modified pull-up test capable of discriminating among subjects of low strength and endurance. (MJB)
Descriptors: Equipment, Low Achievement, Measurement Techniques, Muscular Strength
Peer reviewedKratochwill, Thomas R.; Brody, Gene H. – Journal of Consulting and Clinical Psychology, 1976
Subjects were randomly assigned to one of three groups: standard WAIS administration; a praise condition with praise for each correct WAIS response; and a self-monitoring condition with direct feedback on response accuracy. Results indicated that specific feedback is effective in inducing IQ test performance change in normal adults. (NG)
Descriptors: Behavior Change, College Students, Feedback, Intelligence Tests
Peer reviewedDavis, William A. – Journal of the American College Health Association, 1977
Descriptors: Athletes, Cardiovascular System, Physical Examinations, Test Interpretation
Peer reviewedRowley, Glenn L.; Traub, Ross E. – Journal of Educational Measurement, 1977
The consequences of formula scoring versus number right scoring are examined in relation to the assumptions commonly made about the behavior of examinees in testing situations. The choice between the two is shown to be dependent upon having reduced error variance or unbiasedness as a goal. (Author/JKS)
Descriptors: Error of Measurement, Scoring Formulas, Statistical Bias, Test Wiseness
Peer reviewedMarco, Gary L. – Journal of Educational Measurement, 1977
This paper summarizes three studies that illustrate how application of the three-parameter logistic test model helped solve three relatively intractable testing problems. The three problems are: designing a multi-purpose test, evaluating an multi-level test, and equating a test on the basis of pretest statistics. (Author/JKS)
Descriptors: Latent Trait Theory, Measurement, Models, Pretests Posttests
Peer reviewedBurket, George R. – Journal of Educational Measurement, 1987
This response to the Baglin paper (1986) points out the fallacy in inferring that inappropriate scaling procedures cause apparent discrepancies between medians and means and between means calculated using different units. (LMO)
Descriptors: Norm Referenced Tests, Scaling, Scoring, Statistical Distributions
Peer reviewedPrior, Margot; And Others – International Journal of Behavioral Development, 1987
This study was designed to (1) gather Australian data on the Toddler Temperament Scale (TTS), (2) assess age differences on temperament in the one- to three-year-old group; (3) assess the psychometric properties of the TTS; and (4) consider some issues of concurrent validity in the measurement of temperament and behavioral adjustment. (Author)
Descriptors: Foreign Countries, Personality Traits, Rating Scales, Research Problems
Peer reviewedCowart, Virginia S. – Physician and Sportsmedicine, 1988
A description of the problems that occurred with attempts to conduct drug tests at the 1987 Pan American games leads to a discussion of the legal challenges to drug testing and the need to establish a clear, effective, and fair policy for drug tests of athletes. (CB)
Descriptors: Athletes, Athletics, Drug Abuse, Illegal Drug Use
Peer reviewedFredericks, Anthony D. – Reading Teacher, 1987
Offers a humorous look at the problem of assessment. (FL)
Descriptors: Elementary Education, Humor, Reading Instruction, Reading Tests
Peer reviewedKoenke, Karl – Reading Teacher, 1987
Looks at ERIC documents that deal with readability concerns. (FL)
Descriptors: Elementary Education, Readability, Readability Formulas, Reading Instruction
Peer reviewedHarrington, Robert G.; And Others – Psychology in the Schools, 1985
Evaluated interscorer reliability of the Spatial Memory subtest, which appears on the Simultaneous Processing scale of the Kaufman Assessment Battery for Children. Responses from 19 gifted children were scored by two independent examiners. Results showed this subtest may be prone to scoring errors because no permanent record of responses exists.…
Descriptors: Elementary Education, Gifted, Interrater Reliability, Preadolescents
Peer reviewedLively, Mary Ann – Language, Speech, and Hearing Services in Schools, 1984
Common problems in using and scoring the Developmental Sentence Scoring procedure to quantify young children's grammatic structure expressive language are reviewed. Scoring examples are provided to help clinicians learn the DDS procedure. (Author/CL)
Descriptors: Expressive Language, Language Handicaps, Language Tests, Scores
Peer reviewedParisi, Marinella; Sias, M. Assunta – Human Development, 1985
Hypothesizes that children may misunderstand the task required by Piaget's test and that researchers may therefore underestimate the children's cognitive capacities. Tests the hypothesis by dividing 48 children of both sexes into two groups, those taking the standard tests and those taking a test restructured to limit ambiguity. (BE)
Descriptors: Ambiguity, Conservation (Concept), Preschool Children, Test Construction
Peer reviewedNewcomer, Phyllis L. – Remedial and Special Education (RASE), 1985
A study of the extent to which two popular Published Reading Inventories (PRIs) identify the same instructional level when administered to 50 children in grades 1 through 7 demonstrate a significant lack of congruence between the instruments, particularly at the intermediate grade levels. (Author/CL)
Descriptors: Elementary Education, Informal Reading Inventories, Reading Difficulties, Test Validity
Peer reviewedMcCauley, Rebecca J.; Swisher, Linda – Journal of Speech and Hearing Disorders, 1984
The paper discusses concepts fundamental to proper use of norm-referenced tests, considers common errors in the use, and suggests alternatives to norm-referenced testing for certain assessment purposes. A hypothetical client is used to illustrate errors including the use of age-equivalent scores as the sole summary of test results. (Author/CL)
Descriptors: Disabilities, Elementary Secondary Education, Norm Referenced Tests, Student Evaluation


