Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Foreign Countries | 5 |
Test Format | 5 |
Test Items | 3 |
Educational Assessment | 2 |
Item Response Theory | 2 |
Multiple Choice Tests | 2 |
Test Theory | 2 |
Test Validity | 2 |
Adaptive Testing | 1 |
Adults | 1 |
Algebra | 1 |
More ▼ |
Source
Applied Psychological… | 5 |
Author
Bell, Richard | 1 |
Birenbaum, Menucha | 1 |
Budescu, David V. | 1 |
Budgell, Glen R. | 1 |
Hol, A. Michiel | 1 |
Lumsden, James | 1 |
Mellenbergh, Gideon J. | 1 |
Vorst, Harrie C. M. | 1 |
Publication Type
Journal Articles | 5 |
Reports - Research | 4 |
Reports - Evaluative | 2 |
Education Level
Higher Education | 1 |
Audience
Location
Australia | 1 |
Canada | 1 |
Israel (Tel Aviv) | 1 |
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007
In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…
Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing

Bell, Richard; Lumsden, James – Applied Psychological Measurement, 1980
The effect of test length on predictive validity is examined empirically. For four tests, the curve of validity against test length had a very gentle slope for the longer tests and all tests could be reduced by more than 60 percent without appreciable decreases in validity. (Author/BW)
Descriptors: Foreign Countries, High School Seniors, High Schools, Mathematical Models

Birenbaum, Menucha; And Others – Applied Psychological Measurement, 1992
The effect of multiple-choice (MC) or open-ended (OE) response format on diagnostic assessment of algebra test performance was investigated with 231 eighth and ninth graders in Tel Aviv (Israel) using bug or rule space analysis. Both analyses indicated closer similarity between parallel OE subsets than between stem-equivalent OE and MC subsets.…
Descriptors: Algebra, Comparative Testing, Educational Assessment, Educational Diagnosis

Budescu, David V. – Applied Psychological Measurement, 1988
A multiple matching test--a 24-item Hebrew vocabulary test--was examined, in which distractors from several items are pooled into one list at the test's end. Construction of such tests was feasible. Reliability, validity, and reduction of random guessing were satisfactory when applied to data from 717 applicants to Israeli universities. (SLD)
Descriptors: College Applicants, Feasibility Studies, Foreign Countries, Guessing (Tests)

Budgell, Glen R.; And Others – Applied Psychological Measurement, 1995
The usefulness of three item response theory-based methods and the Mantel Haenszel technique in evaluating the measurement equivalence of translated assessment instruments was demonstrated in a study involving 2,000 French-speaking Canadian adults who took a French test translation and 2,000 English-speaking adults who took the English original.…
Descriptors: Adults, Chi Square, Cultural Awareness, Culture Fair Tests