ERIC - Search Results

Descriptor

Difficulty Level	9
Estimation (Mathematics)	9
Test Format	9
Test Items	8
Test Construction	5
Adaptive Testing	4
Computer Assisted Testing	4
Equated Scores	3
Mathematical Models	3
Test Reliability	3
College Entrance Examinations	2
Computer Simulation	2
Error of Measurement	2
Higher Education	2
Item Response Theory	2
Latent Trait Theory	2
Licensing Examinations…	2
Multiple Choice Tests	2
Sample Size	2
Scores	2
Ability	1
Bayesian Statistics	1
Comparative Analysis	1
Comparative Testing	1
Computer Software	1
More ▼

Source

Educational and Psychological…	1
Journal of Educational…	1

Author

Ackerman, Terry A.	1
Aiken, Lewis R.	1
Algina, James	1
Carlson, Alfred B.	1
Griffith, William D.	1
Henning, Grant	1
Hsu, Tse-Chi	1
Ito, Kyoko	1
Kirisci, Levent	1
Legg, Sue M.	1
Li, Yuan H.	1
Smith, Robert L.	1
Sykes, Robert C.	1
Tam, Hak P.	1
Wise, Steven L.	1
More ▼

Publication Type

Reports - Evaluative	6
Speeches/Meeting Papers	5
Reports - Research	3
Journal Articles	2

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Equating Multiple Tests via an IRT Linking Design: Utilizing a Single Set of Anchor Items with Fixed Common Item Parameters during the Calibration Process.

Download full text

Li, Yuan H.; Griffith, William D.; Tam, Hak P. – 1997

This study explores the relative merits of a potentially useful item response theory (IRT) linking design: using a single set of anchor items with fixed common item parameters (FCIP) during the calibration process. An empirical study was conducted to investigate the appropriateness of this linking design using 6 groups of students taking 6 forms…

Descriptors: Ability, Difficulty Level, Equated Scores, Error of Measurement

The Estimation of Item Difficulty from Restricted CAT Calibration Samples.

Download full text

Sykes, Robert C.; Ito, Kyoko – 1995

Whether the presence of bidimensionality has any effect on the adaptive recalibration of test items was studied through live-data simulation of computer adaptive testing (CAT) forms. The source data were examinee responses to the 298 scored multiple choice items of a licensure examination in a health care profession. Three 75-item part-forms,…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Estimation (Mathematics)

Analyzing Optional Test Items.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1989

Two alternatives to traditional item analysis and reliability estimation procedures are considered for determining the difficulty, discrimination, and reliability of optional items on essay and other tests. A computer program to compute these measures is described, and illustrations are given. (SLD)

Descriptors: College Entrance Examinations, Computer Software, Difficulty Level, Essay Tests

Using Judgmental Estimates of Item Difficulty To Assemble Test Forms with Equivalent Cut Scores. Research Memorandum.

Download full text

Smith, Robert L.; Carlson, Alfred B. – 1995

The feasibility of constructing test forms with practically equivalent cut scores using judges' estimates of item difficulty as target "statistical" specifications was investigated. Test forms with equivalent judgmental cut scores (based on judgments of item difficulty) were assembled using items from six operational forms of the…

Descriptors: Cutting Scores, Decision Making, Difficulty Level, Equated Scores

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

The Use of Unidimensional Item Parameter Estimates of Multidimensional Items in Adaptive Testing.

Download full text

Ackerman, Terry A. – 1987

The purpose of this study was to investigate the effect of using multidimensional items in a computer adaptive test (CAT) setting which assumes a unidimensional item response theory (IRT) framework. Previous research has suggested that the composite of multidimensional abilities being estimated by a unidimensional IRT model is not constant…

Descriptors: Adaptive Testing, College Entrance Examinations, Computer Assisted Testing, Computer Simulation

Estimation of Ability Level by Using Only Observable Quantities in Adaptive Testing.

Download full text

Kirisci, Levent; Hsu, Tse-Chi – 1992

A predictive adaptive testing (PAT) strategy was developed based on statistical predictive analysis, and its feasibility was studied by comparing PAT performance to those of the Flexilevel, Bayesian modal, and expected a posteriori (EAP) strategies in a simulated environment. The proposed adaptive test is based on the idea of using item difficulty…

Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level