Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 3 |
Descriptor
| Responses | 6 |
| Test Length | 6 |
| Test Items | 4 |
| Test Construction | 3 |
| Adults | 2 |
| Comparative Analysis | 2 |
| Difficulty Level | 2 |
| Foreign Countries | 2 |
| Item Response Theory | 2 |
| Reaction Time | 2 |
| Test Reliability | 2 |
| More ▼ | |
Author
| Bergstrom, Betty | 1 |
| Catts, Ralph | 1 |
| Dougherty, Leanne | 1 |
| Henning, Grant | 1 |
| Lee, Jihyun | 1 |
| Lee, Yi-Hsuan | 1 |
| Paek, Insu | 1 |
| Stammer, Emily | 1 |
| Valente, Thomas W. | 1 |
| Zhang, Jinming | 1 |
Publication Type
| Reports - Research | 6 |
| Journal Articles | 3 |
| Numerical/Quantitative Data | 1 |
| Speeches/Meeting Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Valente, Thomas W.; Dougherty, Leanne; Stammer, Emily – Field Methods, 2017
This study investigates potential bias that may arise when surveys include question items for which multiple units are elicited. Examples of such items include questions about experiences with multiple health centers, comparison of different products, or the solicitation of egocentric network data. The larger the number of items asked about each…
Descriptors: Foreign Countries, Interviews, Surveys, Time
Lee, Jihyun; Paek, Insu – Journal of Psychoeducational Assessment, 2014
Likert-type rating scales are still the most widely used method when measuring psychoeducational constructs. The present study investigates a long-standing issue of identifying the optimal number of response categories. A special emphasis is given to categorical data, which were generated by the Item Response Theory (IRT) Graded-Response Modeling…
Descriptors: Likert Scales, Responses, Item Response Theory, Classification
Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010
This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…
Descriptors: Test Bias, Item Response Theory, Test Items, Scores
Bergstrom, Betty; And Others – 1994
Examinee response times from a computerized adaptive test taken by 204 examinees taking a certification examination were analyzed using a hierarchical linear model. Two equations were posed: a within-person model and a between-person model. Variance within persons was eight times greater than variance between persons. Several variables…
Descriptors: Adaptive Testing, Adults, Certification, Computer Assisted Testing
Catts, Ralph – 1978
The reliability of multiple choice tests--containing different numbers of response options--was investigated for 260 students enrolled in technical college economics courses. Four test forms, constructed from previously used four-option items, were administered, consisting of (1) 60 two-option items--two distractors randomly discarded; (2) 40…
Descriptors: Answer Sheets, Difficulty Level, Foreign Countries, Higher Education
Henning, Grant – 1991
Criticisms of the Test of English as a Foreign Language (TOEFL) have included speculation that the listening test places too much burden on short-term memory as compared with comprehension, that a knowledge of reading is required to respond successfully, and that many items appear to require mere recall and matching rather than higher-order…
Descriptors: Adults, Auditory Stimuli, Cognitive Processes, Educational Assessment

Peer reviewed
Direct link
