Publication Date
In 2025 | 42 |
Since 2024 | 165 |
Since 2021 (last 5 years) | 588 |
Since 2016 (last 10 years) | 1225 |
Since 2006 (last 20 years) | 2731 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 70 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |

Schmeiser, Cynthia B.; Ferguson, Richard L. – Educational Horizons, 1979
Test materials in English and social studies, reflecting Black and White culture, were developed and administered to Black and White high school students. Analyses of variance indicated that performance by either group was not affected by Black or White culture content. White examinees scored higher on all tests. (SJL)
Descriptors: Academic Achievement, Black Students, Content Analysis, Cultural Influences

Kirsch, Irwin S. – Journal of Educational Measurement, 1980
The construct validity of reading comprehension test items was studied in a two-stage process. Five characteristics of task difficulty were defined and a heterogeneous set of 52 items were rated for these characteristics. Then correlations were obtained between ratings and item difficulty data. (CTM)
Descriptors: Adults, Cognitive Processes, Difficulty Level, Evaluation Criteria

Donlon, Thomas F.; And Others – Applied Psychological Measurement, 1980
The scope and nature of sex differences in the Graduate Record Examination are explored by identifying individual test items that differ from the other items in terms of the magnitude of the difference in item difficulty for the sexes. In general, limited evidence of differences was established. (Author/CTM)
Descriptors: Aptitude Tests, College Entrance Examinations, Graduate Students, Higher Education

Hansen, Chris J.; Zytowski, Donald G. – Educational and Psychological Measurement, 1979
A measure of the extent to which the Kuder Occupational Interest Survey (KOIS) was predictive of occupational membership for an individual was correlated with KOIS item and scale scores. Results indicated that the KOIS was a moderator of its own predictive validity. (Author/JKS)
Descriptors: Females, Followup Studies, Interest Inventories, Item Analysis
Green, Douglas W.; Hering, Jeffrey – Creative Computing, 1979
A computer program for analyzing test results is presented. Item analysis and the point biserial correlation coefficient, both generated by this program, are explained and discussed. (MK)
Descriptors: Computer Programs, Computers, Educational Testing, Evaluation

Gottfried, Gail M. – Journal of Child Language, 1997
Investigates comprehension of metaphoric compounds. The study asked children and adults to identify referents of various types of these compounds. Findings reveal that 5-year-olds outperformed 3-year-olds but performed significantly less well than adults and that 3-year-olds did not generally interpret the compounds literally. (37 references)…
Descriptors: Adults, Child Language, Cognitive Development, College Students

Kolstad, Rosemarie K.; Kolstad, Robert A. – Journal of Dental Education, 1991
A study evaluated the use of a "none-of-these" option on multiple-choice achievement tests in undergraduate dental education. Results indicated this option neither enhanced nor diminished examinee performance stability but did reduce the examinee's opportunity to select correct choices by means unrelated to course objectives, thereby enhancing…
Descriptors: Achievement Tests, Dental Schools, Difficulty Level, Higher Education

Johnson, Sylvia T.; Wallace, Michael B. – Journal of Educational Measurement, 1989
The effects were assessed of a test coaching program for urban Black youth, who intended to take the Scholastic Aptitude Test (SAT), on their performance on quantitative items. Findings for 116 program participants are discussed in relation to the improvement of supplemental instructional programs. (TJH)
Descriptors: Black Students, College Entrance Examinations, High School Students, Item Analysis

Aiken, Lewis R. – Educational and Psychological Measurement, 1989
Two alternatives to traditional item analysis and reliability estimation procedures are considered for determining the difficulty, discrimination, and reliability of optional items on essay and other tests. A computer program to compute these measures is described, and illustrations are given. (SLD)
Descriptors: College Entrance Examinations, Computer Software, Difficulty Level, Essay Tests

Gothberg, Helen M.; Aleamoni, Lawrence M. – Journal of Education for Library and Information Science, 1988
Describes an objective test used as the comprehensive examination in a graduate library school and discusses its advantages over essay tests. The topics covered include test construction, the use of item analysis for scoring and test revision, and student reactions to the objective test. (1 reference) (CLB)
Descriptors: Case Studies, Graduate Study, Graduation Requirements, Higher Education

Miller, M. David; Linn, Robert L. – Journal of Educational Measurement, 1988
Effects of instructional coverage variations on item characteristic functions were examined, using eighth-grade data from the Second International Mathematics Study (1985). Although some differences in item response curves were large, better performance was not necessarily related to greater learning opportunity. Item curve response differences…
Descriptors: Achievement Tests, Black Students, Elementary Education, Grade 8

Brown, James Dean – Language Testing, 1988
The reliability and validity of a cloze procedure used as an English-as-a-second-language (ESL) test in China were improved by applying traditional item analysis and selection techniques. The 'best' test items were chosen on the basis of item facility and discrimination indices, and were administered as a 'tailored cloze.' 29 references listed.…
Descriptors: Adaptive Testing, Cloze Procedure, English (Second Language), Foreign Countries

Trevisan, Michael S.; And Others – Educational and Psychological Measurement, 1994
The reliabilities of 2-, 3-, 4-, and 5-choice tests were compared through an incremental-option model on a test taken by 154 high school seniors. Creating the test forms incrementally more closely approximates actual test construction. The nonsignificant differences among the option choices support the three-option item. (SLD)
Descriptors: Distractors (Tests), Estimation (Mathematics), High School Students, High Schools

Ruffalo, Stacey L.; Elliott, Stephen N. – School Psychology Review, 1997
Describes assessment of children's social behavior using Social Skills Rating System (SSRS) and an item analysis protocol (IAP). Examines relationships among mother and father pairs, teacher SSRS ratings, and descriptions of kindergarten through 4th-grade students. Finds mothers' and fathers' SSRS frequency and importance ratings for their…
Descriptors: Behavior Rating Scales, Children, Elementary School Students, Evaluation

Rifkin, Benjamin; Roberts, Felicia D. – Language Learning, 1995
Examines error gravity research design and its theoretical assumptions. Results indicate that investigators have only skimmed the surface of the process of error evaluation, which is shaped by extralinguistic factors. The article concludes that researchers should reconceptualize error gravity research and reassess earlier studies to confirm or…
Descriptors: Context Effect, Discourse Analysis, Error Analysis (Language), Error Patterns