ERIC - Search Results

Source

Educational Measurement:…	2
Educational and Psychological…	1

Author

Wainer, Howard	8
Kiely, Gerard L.	1
Lukhele, Robert	1
Thissen, David	1

Publication Type

Reports - Evaluative	5
Journal Articles	3
Reports - Research	2
Guides - Non-Classroom	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Audience

Location

Canada	1
Israel	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Comparing the Incomparable: An Essay on the Importance of Big Assumptions and Scant Evidence.

Peer reviewed

Wainer, Howard – Educational Measurement: Issues and Practice, 1999

Discusses the comparison of groups of individuals who were administered different forms of a test. Focuses on the situation in which there is little overlap in content between the test forms. Reviews equating problems in national tests in Canada and Israel. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Foreign Countries, National Competency Tests

How Reliable Are TOEFL Scores?

Peer reviewed

Wainer, Howard; Lukhele, Robert – Educational and Psychological Measurement, 1997

The reliability of scores from four forms of the Test of English as a Foreign Language (TOEFL) was estimated using a hybrid item response theory model. It was found that there was very little difference between overall reliability when the testlet items were assumed to be independent and when their dependence was modeled. (Author/SLD)

Descriptors: English (Second Language), Item Response Theory, Scores, Second Language Learning

Some Practical Considerations when Converting a Linearly Administered Test to an Adaptive Format.

Peer reviewed

Wainer, Howard – Educational Measurement: Issues and Practice, 1993

Some cautions are sounded for converting a linearly administered test to an adaptive format. Four areas are identified in which practices broadly used in traditionally constructed tests can have adverse effects if thoughtlessly adopted when a test is administered in an adaptive mode. (SLD)

Descriptors: Adaptive Testing, Computer Assisted Testing, Educational Practices, Test Construction

Some Empirical Guidelines for Building Testlets. Program Statistics Research Technical Report No. 91-14.

Download full text

Wainer, Howard; And Others – 1991

A series of computer simulations was run to measure the relationship between testlet validity and the factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Results confirmed the generality of earlier empirical findings of H. Wainer and others (1991) that making a testlet adaptive yields only marginal…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Item Banks

How Well Can We Equate Test Forms That Are Constructed by Examinees? Program Statistics Research.

Download full text

Wainer, Howard; And Others – 1991

When an examination consists, in whole or in part, of constructed response items, it is a common practice to allow the examinee to choose among a variety of questions. This procedure is usually adopted so that the limited number of items that can be completed in the allotted time does not unfairly affect the examinee. This results in the de facto…

Descriptors: Adaptive Testing, Chemistry, Comparative Analysis, Computer Assisted Testing

On Examinee Choice in Educational Testing. GRE Board Professional Report No. 91-17P.

Download full text

Wainer, Howard; Thissen, David – 1994

When an examination consists in whole or part of constructed response test items, it is common practice to allow the examinee to choose a subset of the constructed response questions from a larger pool. It is sometimes argued that, if choice were not allowed, the limitations on domain coverage forced by the small number of items might unfairly…

Descriptors: Constructed Response, Difficulty Level, Educational Testing, Equated Scores

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

Test Format	8
Test Items	8
Adaptive Testing	5
Computer Assisted Testing	5
Test Construction	5
Comparative Analysis	3
Difficulty Level	3
Equated Scores	3
Item Banks	3
Test Length	3
Test Validity	3
Constructed Response	2
Item Response Theory	2
Models	2
Testing Problems	2
Algebra	1
Algorithms	1
Chemistry	1
Computer Simulation	1
Construct Validity	1
Educational Assessment	1
Educational Practices	1
Educational Testing	1
English (Second Language)	1
Foreign Countries	1
More ▼