Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Tomsic, Margie L.; And Others – 1987
Extended caution indices (ECI) specify the degree of confidence that can be placed in an individual's test score by analyzing patterns of item response. Among the most promising of such indices are the standardized ECIs. Contrary to the literature, several instances were found, in a previous study, of nonnormal distributions of ECIs with samples…
Descriptors: Achievement Tests, Elementary Education, Goodness of Fit, Latent Trait Theory
Wilcox, Rand R. – 1981
These studies in test adequacy focus on two problems: procedures for estimating reliability, and techniques for identifying ineffective distractors. Fourteen papers are presented on recent advances in measuring achievement (a response to Molenaar); "an extension of the Dirichlet-multinomial model that allows true score and guessing to be…
Descriptors: Achievement Tests, Criterion Referenced Tests, Guessing (Tests), Mathematical Models
Ironson, Gail H.; Craig, Robert – 1982
This study was designed to increase knowledge of the functioning of item bias techniques in detecting biased items. Previous studies have used computer-generated data or real data with unknown amounts of bias. The present project extends previous studies by using items that are logically generated and subjectively evaluated a priori to be biased…
Descriptors: Ability Grouping, Difficulty Level, Higher Education, Item Analysis
Blair, Mark W.; And Others – 1978
The Tredyffrin/Easttown Sex Fairness Survey was designed to assess sex fairness in attitudes toward both sexes in work, home, educational, social, and other contexts. Each subscale contains positively and negatively worded Likert-type items of three types: trait to group; role to group; and judgment of individuals based on group. A preliminary…
Descriptors: Adolescents, Adults, Attitude Measures, Females
Ree, Malcolm James – 1978
The computer can assist test construction in the following four ways: (1) storage or banking of test items; (2) banking of item attributes; (3) test construction; and (4) test printing. Automated Item Banking (AIB) is a computerized item storage and test construction system which illustrates these capabilities. It was developed, implemented, and…
Descriptors: Aptitude Tests, Computer Assisted Testing, Computers, Higher Education
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to develop a plan to compare models for test equating. The five alternative models to be considered for equating are: (1) anchor test equating using…
Descriptors: Equated Scores, Error of Measurement, Latent Trait Theory, Mathematical Models
Leary, Mark R.; And Others – 1980
Since its appearance in 1974, the Snyder Self-Monitoring Scale has been employed in research dealing with self-presentation, attribution, and attitude expression. The Scale was developed to measure the degree to which people are concerned with the social appropriateness of their behavior, are aware of relevant social cues, and regulate their…
Descriptors: Adults, Attribution Theory, Behavior Rating Scales, Factor Analysis
Jensen, Harald E.; And Others – 1976
The Department of Defense directed that the military services move toward the use of a common aptitude battery for satisfaction of their personnel selection requirements. The content of existing service classifications were studied during the initial phase of battery redesign. Elements from the Army Classification Inventory, the Navy Vocational…
Descriptors: Aptitude Tests, Difficulty Level, High School Students, Military Personnel
Benson, Jeri – 1979
Two methods of item selection were used to select sets of 40 items from a 50-item verbal analogies test, and the resulting item sets were compared for relative efficiency. The BICAL program was used to select the 40 items having the best mean square fit to the one parameter logistic (Rasch) model. The LOGIST program was used to select the 40 items…
Descriptors: Comparative Analysis, Computer Programs, Costs, Efficiency
Thompson, Bruce; Levitov, Justin E. – Collegiate Microcomputer, 1985
Discusses features of a microcomputer program, SCOREIT, used at New Orleans' Loyola University and several high schools to score and analyze test results. Benefits and dimensions of the program's automated test and item analysis are outlined, and several examples illustrating test and item analyses by SCOREIT are presented. (MBR)
Descriptors: Computer Assisted Testing, Computer Software, Difficulty Level, Higher Education
McCowan, Richard J.; McCowan, Sheila C. – Online Submission, 1999
This paper describes major concepts related to item analysis for criterion-referenced tests including validity, reliability, item difficulty, and item discrimination, particularly in relation to criterion-referenced tests. The paper discussed how these concepts can be used to revise and improve items and listed suggestions regarding general…
Descriptors: Criterion Referenced Tests, Standard Setting, Item Analysis, Item Response Theory
Peer reviewedFrisbie, David A.; Sweeney, Daryl C. – Journal of Educational Measurement, 1982
A 100-item five-choice multiple choice (MC) biology final exam was converted to multiple choice true-false (MTF) form to yield two content-parallel test forms comprised of the two item types. Students found the MTF items easier and preferred MTF over MC; the MTF subtests were more reliable. (Author/GK)
Descriptors: Biology, College Science, Comparative Analysis, Difficulty Level
Peer reviewedGoh, David S. – Applied Psychological Measurement, 1979
The advantages of using psychometric thoery to design short forms of intelligence tests are demonstrated by comparing such usage to a systematic random procedure that has previously been used. The Wechsler Intelligence Scale for Children Revised (WISC-R) Short Form is presented as an example. (JKS)
Descriptors: Elementary Secondary Education, Intelligence Tests, Item Analysis, Psychometrics
Peer reviewedLoyd, Brenda H. – Applied Measurement in Education, 1990
Four mathematics test-item types that may perform differently when calculators are used were assessed using data from 160 high school students attending a summer enrichment program. The effects of testing with and without calculators on testing time, test reliability, item difficulty, and item discrimination were also assessed. (TJH)
Descriptors: Calculators, Difficulty Level, High School Students, High Schools
Peer reviewedSchriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991
Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)
Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)


