Publication Date
| In 2026 | 0 |
| Since 2025 | 55 |
| Since 2022 (last 5 years) | 197 |
| Since 2017 (last 10 years) | 497 |
| Since 2007 (last 20 years) | 745 |
Descriptor
| Test Items | 1189 |
| Test Reliability | 1189 |
| Test Validity | 687 |
| Test Construction | 567 |
| Foreign Countries | 349 |
| Difficulty Level | 280 |
| Item Analysis | 253 |
| Psychometrics | 236 |
| Item Response Theory | 219 |
| Factor Analysis | 184 |
| Multiple Choice Tests | 173 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 69 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedKarnes, Frances A.; Brown, K. Eliot – Psychology in the Schools, 1981
A study to develop a short form of the Wechsler Intelligence Scale for Children-Revised (WISC-R) for the intellectually gifted showed the Vocabulary and Block Design comprise the best two-subtest short form. The Similarities, Vocabulary, Block Design, and Object Assembly tetrad could be most useful in time and reliability. (Author)
Descriptors: Academically Gifted, Elementary Secondary Education, Intelligence Tests, Screening Tests
Peer reviewedRusch, Reuben; Steiner, Judith – Journal of Experimental Education, 1979
The Selected Marker Tests were examined for scoring problems and internal consistency and were administered orally to sixth and seventh graders. Scoring problems were discovered and changes were suggested. The problem was found to be item reliability rather than interrater reliability. (Author/MH)
Descriptors: Cognitive Tests, Elementary Education, Item Analysis, Problem Solving
Peer reviewedWainer, Howard; Lukhele, Robert – Educational and Psychological Measurement, 1997
The reliability of scores from four forms of the Test of English as a Foreign Language (TOEFL) was estimated using a hybrid item response theory model. It was found that there was very little difference between overall reliability when the testlet items were assumed to be independent and when their dependence was modeled. (Author/SLD)
Descriptors: English (Second Language), Item Response Theory, Scores, Second Language Learning
Peer reviewedEngelhard, George, Jr.; And Others – Journal of Research and Development in Education, 1990
Results are reported from a study that investigated the correspondence between two methods used to assess differential item functioning (test item bias). The study also explored the influence of sample size on the two procedures. Although agreement between the two procedures was generally good, the Rasch procedure was more reliable. (IAH)
Descriptors: Comparative Analysis, Elementary Secondary Education, Item Bias, Racial Differences
Peer reviewedWatson, Jane M. – Journal of Educational Psychology, 1988
The Achievement Anxiety Test's dimensionality was assessed using data from 378 university students. Analyses suggest the viability of a unidimensional construct, whose ability to provide extreme subject groups showing differences on other characteristics of academic achievement was assessed. Such a scale has potential for separating…
Descriptors: Academic Achievement, College Students, Factor Analysis, Higher Education
Peer reviewedAntonak, Richard F.; Harth, Robert – Mental Retardation, 1994
Psychometric analyses of data from 230 individuals yielded a 29-item 4-scale revision of the original 50-item 5-scale Mental Retardation Attitude Inventory. Results showed adequate item characteristics; adequate reliability and homogeneity; adequate reliability, homogeneity, specificity, and independence of the four scales; and initial validity…
Descriptors: Attitude Measures, Attitudes toward Disabilities, Mental Retardation, Psychometrics
Peer reviewedDavidson, Fred – System, 2000
Statistical analysis tools in language testing are described, chiefly classical test theory and item response theory. Computer software for statistical analysis is briefly reviewed and divided into three tiers: commonly available; statistical packages; and specialty software. (Author/VWL)
Descriptors: Computer Software, Language Tests, Second Language Learning, Statistical Analysis
Feldt, Leonard S. – Educational and Psychological Measurement, 2005
To meet the requirements of the No Child Left Behind Act, school districts and states must compile summary reports of the levels of student achievement in reading and mathematics. The levels are to be described in broad categories: "basic and below," "proficient," or "advanced." Educational units are given considerable latitude in defining the…
Descriptors: Federal Legislation, Academic Achievement, Test Items, Test Validity
McCane, Sara Jean – Journal of Psychoeducational Assessment, 2006
The Motor-Free Visual Perception Test: Third edition (MVPT-3; Colarusso & Hammill, 2003) purports to measure overall visual perceptual ability. Task responses require no motor ability, eliminating the effect of motor performance on the overall visual perception score. The test authors suggested that this MVPT-3 characteristic allows for its…
Descriptors: Visual Perception, Perception Tests, Test Reviews, Psychomotor Skills
Erford, Bradley T. – Assessment for Effective Intervention, 2004
Technical characteristics of the Reading Essential Skill Screener--Preschool Version (RESS-P) were studied using four independent samples of boys and girls aged 3-5 years. A decision efficiency study (N = 91) resulted in a total predictive value (TPV) of .85 when compared with the criterion of teacher report/judgment of emerging literacy at-risk…
Descriptors: Early Reading, Test Validity, Test Reliability, Item Analysis
Erford, Bradley T.; Stephens, Vicki M. – Measurement and Evaluation in Counseling and Development, 2005
Technical characteristics of the Reading Essential Skills Screener-Elementary Version (RESS-E; B. T. Erford, G. Vitali, R. Haas, & R. R. Boykin, 1995) were studied using 4 independent samples of boys and girls between the ages of 6 and 8 years. Evidence of internal consistency, test-retest reliability, decision efficiency, factorial validity,…
Descriptors: Validity, Test Reliability, Reading Skills, Screening Tests
Fortunato, Vincent J.; LeBourgeois, Monique K.; Harsh, John – Educational and Psychological Measurement, 2008
This article describes the development of a measure of adult sleep quality: the Adult Sleep-Wake Scale (ADSWS). The ADSWS is a self-report pencil-and-paper measure of sleep quality consisting of five behavioral dimensions (Going to Bed, Falling Asleep, Maintaining Sleep, Reinitiating Sleep, and Returning to Wakefulness). Data were collected from…
Descriptors: Construct Validity, Test Validity, Sleep, Personality Traits
Boldt, R. F. – 1992
The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…
Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency
Budescu, David V.; And Others – 1994
Modified Parallel Analysis (MPA) is a heuristic method for assessing "approximate unidimensionality" of item pools. It compares the second eigenvalue of the observed correlation matrix with the corresponding eigenvalue extracted from a "parallel" matrix generated by a unidimensional and locally independent model. Revised…
Descriptors: Equations (Mathematics), Heuristics, Item Analysis, Item Banks
Dorans, Neil J.; Zeller, Karin – ETS Research Report Series, 2004
In the Spring 2003 issue of "Harvard Educational Review," Roy Freedle stated that the SAT® is both culturally and statistically biased, and he proposed a solution to ameliorate this bias. His claims, which garnered national attention, were based on serious errors in his analysis. We begin our analyses by assessing the psychometric…
Descriptors: Test Bias, Statistical Bias, Psychometrics, College Entrance Examinations

Direct link
