Publication Date
In 2025 | 42 |
Since 2024 | 165 |
Since 2021 (last 5 years) | 588 |
Since 2016 (last 10 years) | 1225 |
Since 2006 (last 20 years) | 2731 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 70 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |

Bonheim, Helmut – International Review of Applied Linguistics in Language Teaching, 1971
Descriptors: Achievement Tests, Diagnostic Tests, Item Analysis, Language Proficiency

Ladd, Eleanor M. – Reading Teacher, 1971
Descriptors: Data Analysis, Diagnostic Teaching, Error Patterns, Item Analysis

Aiken, Lewis R. – Educational and Psychological Measurement, 1983
Each of six forms of a 10-item teacher evaluation rating scale, having two to seven response categories per form, was administered to over 100 college students. Means of item responses and item variances increased with the number of response categories. Internal consistency of total scores did not change systematically. (Author/PN)
Descriptors: College Students, Higher Education, Item Analysis, Rating Scales

Hofler, Donald B. – Reading World, 1983
Argues that, in determining if a standardized test task and a classroom performance task are the same, educators should carefully analyze the tasks in terms of the input modality (stimulus) and the output modality (response). (FL)
Descriptors: Achievement Tests, Comparative Analysis, Informal Assessment, Item Analysis

Schmitt, Neal; And Others – NASSP Bulletin, 1982
Describes the results of a two-year study to evaluate an assessment center. Results indicate that it is a valuable, valid, job-related instrument for the selection of school administrators. (Author/JM)
Descriptors: Administrator Characteristics, Administrator Evaluation, Administrator Selection, Elementary Secondary Education

Ironson, Gail H.; Subkoviak, Michael J. – Journal of Educational Measurement, 1979
Test data from two diverse culture groups were analyzed to determine the agreement among four methods of detecting item bias (transformed difficulty, discrimination differences, chi-square, and item characteristic curve). The test battery contained 155 items from six subtests: vocabulary, reading comprehension, mathematics, letter groups,…
Descriptors: Comparative Analysis, High Schools, Item Analysis, Racial Differences
Duncan, Mary Ellen – Community College Frontiers, 1980
Outlines a six-step process which can be used by faculty teams to assess the validity of criterion-referenced tests. Steps include: comparing test items with course objectives, assessing the test in terms of the domains and levels of Bloom's Taxonomy, and examining the appropriateness of various types of test questions. (JP)
Descriptors: College Faculty, Criterion Referenced Tests, Evaluation Criteria, Item Analysis

Tall, Graham – Mathematics in School, 1979
Advantages and disadvantages of a test item analysis method for producing item banks are discussed with respect to different teaching methods, curriculum, and the examination system. (MP)
Descriptors: Educational Testing, Elementary Secondary Education, Item Analysis, Item Banks

Douglass, Frazier M., IV; And Others – Educational and Psychological Measurement, 1979
Classical item analysis and Rasch latent trait analysis were applied to the responses of a sample of undergraduates to two measures concerning alcoholism. Little difference in terms of practical considerations was found between the methods. (JKS)
Descriptors: Alcoholism, Comparative Analysis, Drinking, Higher Education
Rojahn, Johannes; Tasse, Marc J.; Sturmey, Peter – American Journal on Mental Retardation, 1997
Development of the Stereotyped Behavior Scale for adolescents and adults with mental retardation is described. Use with 600 individuals resulted in refinement and a 26-item scale with an internal consistency alpha of 0.88, test-retest reliability of p=0.90, and interrater reliability of p=0.76. (DB)
Descriptors: Adolescents, Adults, Behavior Patterns, Behavior Rating Scales

Weir, Cyril J.; Porter, Don – Reading in a Foreign Language, 1994
Discusses the relevance for the valid testing of reading of the difference between a 'unitary skill' approach to reading and a multiskills approach. The article produces evidence suggesting that a distinction may be drawn between language-based skills and 'global' reading skills. (55 references) (Author/CK)
Descriptors: Foreign Countries, Higher Education, Item Analysis, Language Fluency

Bosher, Susan – Nursing Education Perspectives, 2003
Nineteen multiple-choice nursing tests containing 673 items were analyzed for test wiseness, irrelevant difficulty in stem or option, linguistic/structural bias, or cultural bias. Twenty-eight types of flaws occurred at least 10 times each. (Contains 28 references.) (SK)
Descriptors: Culture Fair Tests, Higher Education, Item Analysis, Item Bias

Bonham, L. Adrianne – Adult Education Quarterly, 1991
Examination of the construct validity of Guglielmino's Self-Directed Learning Readiness Scale points toward dislike of learning as the cause of low scores. High scores seem to result from positive attitudes toward learning in general and not specifically self-directed learning. Further studies are needed to distinguish between readiness for…
Descriptors: Adult Education, Adult Learning, Construct Validity, Educational Attitudes

Schuldberg, David – Computers in Human Behavior, 1988
Describes study that investigated the effects of computerized test administration on undergraduates' responses to the Minnesota Multiphasic Personality Inventory (MMPI), and discusses methodological considerations important in evaluating the sensitivity of personality inventories in different administration formats. Results analyze the effects of…
Descriptors: Analysis of Variance, Comparative Testing, Computer Assisted Testing, Higher Education

Jacobs, Stanley S. – Research in Higher Education, 1995
Comparison of college freshman performance on two different forms of the California Critical Thinking Skills Test (n=684, 692) found a lack of equivalence between forms and low internal consistency reliability. It is suggested that, although the test may be useful for research, it is not appropriate for decision making about individual students.…
Descriptors: College Freshmen, Comparative Analysis, Critical Thinking, Educational Research