ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Responses	9
Test Length	9
Test Items	6
Item Response Theory	4
Ability	3
Test Construction	3
Adaptive Testing	2
Adults	2
Comparative Analysis	2
Correlation	2
Difficulty Level	2
Estimation (Mathematics)	2
Foreign Countries	2
Multiple Choice Tests	2
Reaction Time	2
Sample Size	2
Test Reliability	2
Timed Tests	2
Answer Sheets	1
Auditory Stimuli	1
Bias	1
Certification	1
Classification	1
Cognitive Processes	1
Computer Assisted Testing	1
More ▼

Source

ETS Research Report Series	1
Field Methods	1
Journal of Psychoeducational…	1

Author

Bergstrom, Betty	1
Catts, Ralph	1
Dougherty, Leanne	1
Henning, Grant	1
Lee, Jihyun	1
Lee, Yi-Hsuan	1
Mislevy, Robert J.	1
Nandakumar, Ratna	1
Paek, Insu	1
Stammer, Emily	1
Valente, Thomas W.	1
Wu, Pao-Kuei	1
Yamamoto, Kentaro	1
Yu, Feng	1
Zhang, Jinming	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	3
Reports - Evaluative	3
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Audience

Location

Australia	1
Ghana	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Response Bias over Time: Interviewer Learning and Missing Data in Egocentric Network Surveys

Peer reviewed

Direct link

Valente, Thomas W.; Dougherty, Leanne; Stammer, Emily – Field Methods, 2017

This study investigates potential bias that may arise when surveys include question items for which multiple units are elicited. Examples of such items include questions about experiences with multiple health centers, comparison of different products, or the solicitation of egocentric network data. The larger the number of items asked about each…

Descriptors: Foreign Countries, Interviews, Surveys, Time

In Search of the Optimal Number of Response Categories in a Rating Scale

Peer reviewed

Direct link

Lee, Jihyun; Paek, Insu – Journal of Psychoeducational Assessment, 2014

Likert-type rating scales are still the most widely used method when measuring psychoeducational constructs. The present study investigates a long-standing issue of identifying the optimal number of response categories. A special emphasis is given to categorical data, which were generated by the Item Response Theory (IRT) Graded-Response Modeling…

Descriptors: Likert Scales, Responses, Item Response Theory, Classification

Differential Item Functioning: Its Consequences. Research Report. ETS RR-10-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010

This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…

Descriptors: Test Bias, Item Response Theory, Test Items, Scores

Estimating the Effects of Test Length and Test Time on Parameter Estimation Using the HYBRID Model.

Download full text

Yamamoto, Kentaro – 1995

The traditional indicator of test speededness, missing responses, clearly indicates a lack of time to respond (thereby indicating the speededness of the test), but it is inadequate for evaluating speededness in a multiple-choice test scored as number correct, and it underestimates test speededness. Conventional item response theory (IRT) parameter…

Descriptors: Ability, Estimation (Mathematics), Item Response Theory, Multiple Choice Tests

Inferring Examinee Ability When Some Item Responses Are Missing.

Download full text

Mislevy, Robert J.; Wu, Pao-Kuei – 1988

The basic equations of item response theory provide a foundation for inferring examinees' abilities and items' operating characteristics from observed responses. In practice, though, examinees will usually not have provided a response to every available item--for reasons that may or may not have been intended by the test administrator, and that…

Descriptors: Ability, Adaptive Testing, Equations (Mathematics), Estimation (Mathematics)

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

Computerized Adaptive Testing Exploring Examinee Response Time Using Hierarchical Linear Modeling.

Download full text

Bergstrom, Betty; And Others – 1994

Examinee response times from a computerized adaptive test taken by 204 examinees taking a certification examination were analyzed using a hierarchical linear model. Two equations were posed: a within-person model and a between-person model. Variance within persons was eight times greater than variance between persons. Several variables…

Descriptors: Adaptive Testing, Adults, Certification, Computer Assisted Testing

Q. How Many Options Should a Multiple-Choice Question Have? (a) 2. (b) 3. (c) 4. At-a-glance Research Report.

Catts, Ralph – 1978

The reliability of multiple choice tests--containing different numbers of response options--was investigated for 260 students enrolled in technical college economics courses. Four test forms, constructed from previously used four-option items, were administered, consisting of (1) 60 two-option items--two distractors randomly discarded; (2) 40…

Descriptors: Answer Sheets, Difficulty Level, Foreign Countries, Higher Education

A Study of the Effects of Variation of Short-Term Memory Load, Reading Response Length, and Processing Hierarchy on TOEFL Listening Comprehension Item Performance. Report 33.

Download full text

Henning, Grant – 1991

Criticisms of the Test of English as a Foreign Language (TOEFL) have included speculation that the listening test places too much burden on short-term memory as compared with comprehension, that a knowledge of reading is required to respond successfully, and that many items appear to require mere recall and matching rather than higher-order…

Descriptors: Adults, Auditory Stimuli, Cognitive Processes, Educational Assessment