ERIC - Search Results

Descriptor

Estimation (Mathematics)	6
Test Format	6
Test Length	6
Test Items	3
Test Reliability	3
Adaptive Testing	2
Error of Measurement	2
Ability	1
Achievement Tests	1
Classification	1
Computer Assisted Testing	1
Computer Simulation	1
Cutting Scores	1
Difficulty Level	1
English (Second Language)	1
Equations (Mathematics)	1
Error Correction	1
Evaluation Methods	1
Higher Education	1
Intelligence Tests	1
Item Response Theory	1
Language Proficiency	1
Language Tests	1
Licensing Examinations…	1
Measurement Techniques	1
More ▼

Source

Applied Measurement in…	2
Psychological Assessment	1

Author

Axelrod, Bradley N.	1
Feldt, Leonard S.	1
Henning, Grant	1
Mislevy, Robert J.	1
Qualls, Audrey L.	1
Reshetar, Rosemary A.	1
Wu, Pao-Kuei	1

Publication Type

Journal Articles	3
Reports - Evaluative	3
Reports - Research	3
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
Test of English as a Foreign…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Estimating the Internal Consistency Reliability of Tests Composed of Testlets Varying in Length.

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 2002

Considers the degree of bias in testlet-based alpha (internal consistency reliability) through hypothetical examples and real test data from four tests of the Iowa Tests of Basic Skills. Presents a simple formula for computing a testlet-based congeneric coefficient. (SLD)

Descriptors: Estimation (Mathematics), Reliability, Statistical Bias, Test Format

Estimating the Reliability of a Test Containing Multiple Item Formats.

Peer reviewed

Qualls, Audrey L. – Applied Measurement in Education, 1995

Classically parallel, tau-equivalently parallel, and congenerically parallel models representing various degrees of part-test parallelism and their appropriateness for tests composed of multiple item formats are discussed. An appropriate reliability estimate for a test with multiple item formats is presented and illustrated. (SLD)

Descriptors: Achievement Tests, Estimation (Mathematics), Measurement Techniques, Test Format

Corrected Estimates of WAIS-R Short Form Reliability and Standard Error of Measurement.

Peer reviewed

Axelrod, Bradley N.; And Others – Psychological Assessment, 1996

The calculations of D. Schretlen, R. H. B. Benedict, and J. H. Bobholz for the reliabilities of a short form of the Wechsler Adult Intelligence Scale--Revised (WAIS-R) (1994) consistently overestimated the values. More accurate values are provided for the WAIS--R and a seven-subtest short form. (SLD)

Descriptors: Error Correction, Error of Measurement, Estimation (Mathematics), Intelligence Tests

Inferring Examinee Ability When Some Item Responses Are Missing.

Download full text

Mislevy, Robert J.; Wu, Pao-Kuei – 1988

The basic equations of item response theory provide a foundation for inferring examinees' abilities and items' operating characteristics from observed responses. In practice, though, examinees will usually not have provided a response to every available item--for reasons that may or may not have been intended by the test administrator, and that…

Descriptors: Ability, Adaptive Testing, Equations (Mathematics), Estimation (Mathematics)

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

An Adaptive Testing Simulation for a Certifying Examination.

Download full text

Reshetar, Rosemary A.; And Others – 1992

This study examined performance of a simulated computerized adaptive test that was designed to help direct the development of a medical recertification examination. The item pool consisted of 229 single-best-answer items from a random sample of 3,000 examinees, calibrated using the two-parameter logistic model. Examinees' responses were known. For…

Descriptors: Adaptive Testing, Classification, Computer Assisted Testing, Computer Simulation