Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Kim, Seock-Ho; Cohen, Allan S. – 1996
Applications of item response theory to practical testing problems including equating, differential item functioning, and computerized adaptive testing, require that item parameter estimates be placed onto a common metric. In this study, three methods for developing a common metric under item response theory are compared: (1) linking separate…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Difficulty Level
Nissan, Susan; And Others – 1996
One of the item types in the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL) test is the dialogue. Because the dialogue item pool needs to have an appropriate balance of items at a range of difficulty levels, test developers have examined items at various difficulty levels in an attempt to identify their…
Descriptors: Classification, Dialogs (Language), Difficulty Level, English (Second Language)
Lietz, Petra H.; Roche, Lawrence A. – 1996
This study investigates whether or not the factor structure of reading comprehension is invariant across large, nationally representative samples of 14-year-old students from four different countries. The data from French-speaking Belgium, Hungary, Italy, and the United States were collected as part of the Reading Literacy Study of 1990-91,…
Descriptors: Adolescents, Correlation, Databases, Factor Analysis
Experimental Study of the Effects of Calculator Use on the Advanced Placement Calculus Examinations.
Morgan, Rick; Stevens, Joe – 1991
Advanced Placement Calculus examinations were administered to nearly 7,000 students in order to determine the impact of calculator use. Both experimental examinations had two sections. Section I items were designed so that a calculator was not needed, but approximately half of the students were permitted to use calculators. Section II items…
Descriptors: Advanced Placement, Calculators, Calculus, College Bound Students
Kennedy, Rob – 1994
The purpose of this study was to compare the scores of students who were allowed unlimited retakes of a multiple-choice test with the scores of students who were limited to only four retakes (five trials) of the same test. The tests were each made up of 20 randomly drawn questions from a large pool of questions about research methods. Three…
Descriptors: Comparative Analysis, Graduate Students, Graduate Study, Higher Education
Hambleton, Ronald; Rodgers, Jane – 1995
This digest introduces three issues to consider when evaluating test items for bias: fairness, bias, and stereotyping. In any bias investigation, the first step is to identify the subgroups of interest. Bias reviews and studies generally focus on differential performance for sex, ethnic, cultural, and religious groups. In preparing an item bias…
Descriptors: Cultural Differences, Culture Fair Tests, Ethnicity, Evaluation Methods
Lam, Peter; Foong, Yoke-Yeen – 1996
An important principle in constructing rating scales is to develop items that reflect various degrees of the "pro" (positive) and "contra" (negative) aspects of the trait being measured. Where both positive and negative items are pooled, they can be arranged in order along the trait continuum, but for classical and item…
Descriptors: Attitude Measures, Foreign Countries, Internship Programs, Item Response Theory
O'Neill, Thomas R.; Lunz, Mary E. – 1996
To generalize test results beyond the particular test administration, an examinee's ability estimate must be independent of the particular items attempted, and the item difficulty calibrations must be independent of the particular sample of people attempting the items. This stability is a key concept of the Rasch model, a latent trait model of…
Descriptors: Ability, Benchmarking, Comparative Analysis, Difficulty Level
Wilson, Kenneth M. – 1989
Possible population differences in speed versus level of Graduate Record Examinations (GRE) reading comprehension scores were explored. The study used operational measures computed post hoc from item-level data in GRE files for a pre-October 1977 version of the verbal test in which 40 GRE reading comprehension (RC) items were included as a…
Descriptors: Correlation, English, Ethnicity, Graduate Study
Barnette, J. Jackson – 1995
When self-administered surveys or questionnaires are administered, there is often a question of whether respondents have attended to the items in a thoughtful manner. This paper examines the results of three different surveys to investigate the occurrence of nonattending behaviors such as missing items, patterns that indicate lack of…
Descriptors: Attention, Behavior Patterns, Educational Research, Elementary School Students
Singley, Mark K.; Bennett, Randy Elliot – 1995
One of the main limitations of the current generation of computer-based tests is its dependency on the multiple-choice item. This research was aimed at extending computer-based testing by bringing limited forms of performance assessment to it in the domain of mathematics. This endeavor involves not only building task types that better reflect…
Descriptors: Computer Assisted Testing, Item Analysis, Mathematics Tests, Multiple Choice Tests
Braun, Henry I.; And Others – 1989
The use of constructed response items in large scale standardized testing has been hampered by the costs and difficulties associated with obtaining reliable scores. The advent of expert systems may signal the eventual removal of this impediment. This study investigated the accuracy with which expert systems could score a new, non-multiple choice…
Descriptors: Computer Science, Constructed Response, Expert Systems, High School Seniors
Ryan, Katherine E.; Chiu, Shuwan – 1996
The use of scientific calculators on standardized mathematics tests is becoming more common, and how their use affects the standardized testing process continues to be a topic of study. This study extends previous work by examining test item characteristics and equity issues, asking whether items designed to be "calculator neutral"…
Descriptors: Calculators, College Students, Content Analysis, Equal Education
Swaak, Janine; And Others – 1997
A study was conducted to develop a test that is able to capture knowledge of an intuitive nature, such as that acquired through discovery learning. The proposed test format is called the "what-if test." Test items in this format consist of the presentation of a situation. A change in the situation is introduced, and learners have to…
Descriptors: College Students, Discovery Learning, Educational Assessment, Evaluation Methods
California State Univ., Los Angeles. National Dissemination and Assessment Center. – 1982
The booklet is part of a grade 10-12 social studies series produced for bilingual education. The series consists of six major thematic modules, with four to five booklets in each. The interdisciplinary modules are based on major ideas and designed to help students understand some major human problems and make sound, responsive decisions to improve…
Descriptors: Behavioral Objectives, Bilingual Instructional Materials, Decision Making, Learning Modules


