ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Responses	6
Test Length	6
Test Items	4
Test Construction	3
Adults	2
Comparative Analysis	2
Difficulty Level	2
Foreign Countries	2
Item Response Theory	2
Reaction Time	2
Test Reliability	2
Adaptive Testing	1
Answer Sheets	1
Auditory Stimuli	1
Bias	1
Certification	1
Classification	1
Cognitive Processes	1
Computer Assisted Testing	1
Correlation	1
Data Collection	1
Educational Assessment	1
English (Second Language)	1
Higher Education	1
Interviews	1
More ▼

Source

ETS Research Report Series	1
Field Methods	1
Journal of Psychoeducational…	1

Author

Bergstrom, Betty	1
Catts, Ralph	1
Dougherty, Leanne	1
Henning, Grant	1
Lee, Jihyun	1
Lee, Yi-Hsuan	1
Paek, Insu	1
Stammer, Emily	1
Valente, Thomas W.	1
Zhang, Jinming	1

Publication Type

Reports - Research	6
Journal Articles	3
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Audience

Location

Australia	1
Ghana	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Response Bias over Time: Interviewer Learning and Missing Data in Egocentric Network Surveys

Peer reviewed

Direct link

Valente, Thomas W.; Dougherty, Leanne; Stammer, Emily – Field Methods, 2017

This study investigates potential bias that may arise when surveys include question items for which multiple units are elicited. Examples of such items include questions about experiences with multiple health centers, comparison of different products, or the solicitation of egocentric network data. The larger the number of items asked about each…

Descriptors: Foreign Countries, Interviews, Surveys, Time

In Search of the Optimal Number of Response Categories in a Rating Scale

Peer reviewed

Direct link

Lee, Jihyun; Paek, Insu – Journal of Psychoeducational Assessment, 2014

Likert-type rating scales are still the most widely used method when measuring psychoeducational constructs. The present study investigates a long-standing issue of identifying the optimal number of response categories. A special emphasis is given to categorical data, which were generated by the Item Response Theory (IRT) Graded-Response Modeling…

Descriptors: Likert Scales, Responses, Item Response Theory, Classification

Differential Item Functioning: Its Consequences. Research Report. ETS RR-10-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010

This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…

Descriptors: Test Bias, Item Response Theory, Test Items, Scores

Computerized Adaptive Testing Exploring Examinee Response Time Using Hierarchical Linear Modeling.

Download full text

Bergstrom, Betty; And Others – 1994

Examinee response times from a computerized adaptive test taken by 204 examinees taking a certification examination were analyzed using a hierarchical linear model. Two equations were posed: a within-person model and a between-person model. Variance within persons was eight times greater than variance between persons. Several variables…

Descriptors: Adaptive Testing, Adults, Certification, Computer Assisted Testing

Q. How Many Options Should a Multiple-Choice Question Have? (a) 2. (b) 3. (c) 4. At-a-glance Research Report.

Catts, Ralph – 1978

The reliability of multiple choice tests--containing different numbers of response options--was investigated for 260 students enrolled in technical college economics courses. Four test forms, constructed from previously used four-option items, were administered, consisting of (1) 60 two-option items--two distractors randomly discarded; (2) 40…

Descriptors: Answer Sheets, Difficulty Level, Foreign Countries, Higher Education

A Study of the Effects of Variation of Short-Term Memory Load, Reading Response Length, and Processing Hierarchy on TOEFL Listening Comprehension Item Performance. Report 33.

Download full text

Henning, Grant – 1991

Criticisms of the Test of English as a Foreign Language (TOEFL) have included speculation that the listening test places too much burden on short-term memory as compared with comprehension, that a knowledge of reading is required to respond successfully, and that many items appear to require mere recall and matching rather than higher-order…

Descriptors: Adults, Auditory Stimuli, Cognitive Processes, Educational Assessment