ERIC - Search Results

Descriptor

Estimation (Mathematics)	13
Test Items	13
Test Length	13
Adaptive Testing	6
Item Response Theory	6
Ability	4
Bayesian Statistics	4
Computer Assisted Testing	4
Computer Simulation	4
Item Banks	4
Sample Size	3
Scores	3
Test Format	3
Test Reliability	3
Comparative Testing	2
Difficulty Level	2
Equations (Mathematics)	2
Error of Measurement	2
Research Methodology	2
Selection	2
Simulation	2
Test Construction	2
Test Results	2
Ability Identification	1
Achievement Tests	1
More ▼

Source

Applied Measurement in…	2
Applied Psychological…	1
Journal of Educational…	1
Psychometrika	1

Author

De Ayala, R. J.	2
Hambleton, Ronald K.	1
Henning, Grant	1
Jones, Russell W.	1
Kim, Haeok	1
Kim, Seock-Ho	1
Lewis, Charles	1
Livingston, Samuel A.	1
McBride, James R.	1
Mislevy, Robert J.	1
Plake, Barbara S.	1
Qualls, Audrey L.	1
Reckase, Mark D.	1
Spray, Judith A.	1
Wainer, Howard	1
Weiss, David J.	1
Wingersky, Marilyn S.	1
Wu, Pao-Kuei	1
More ▼

Publication Type

Reports - Evaluative	9
Journal Articles	5
Reports - Research	4
Speeches/Meeting Papers	3

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

COMPASS (Computer Assisted…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

The Influence of Multidimensionality on the Graded Response Model.

Peer reviewed

De Ayala, R. J. – Applied Psychological Measurement, 1994

Previous work on the effects of dimensionality on parameter estimation for dichotomous models is extended to the graded response model. Datasets are generated that differ in the number of latent factors as well as their interdimensional association, number of test items, and sample size. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Maximum Likelihood Statistics, Sample Size

Estimating the Consistency and Accuracy of Classifications Based on Test Scores.

Peer reviewed

Livingston, Samuel A.; Lewis, Charles – Journal of Educational Measurement, 1995

A method is presented for estimating the accuracy and consistency of classifications based on test scores. The reliability of the score is used to estimate effective test length in terms of discrete items. The true-score distribution is estimated by fitting a four-parameter beta model. (SLD)

Descriptors: Classification, Estimation (Mathematics), Scores, Statistical Distributions

Estimating the Reliability of a Test Containing Multiple Item Formats.

Peer reviewed

Qualls, Audrey L. – Applied Measurement in Education, 1995

Classically parallel, tau-equivalently parallel, and congenerically parallel models representing various degrees of part-test parallelism and their appropriateness for tests composed of multiple item formats are discussed. An appropriate reliability estimate for a test with multiple item formats is presented and illustrated. (SLD)

Descriptors: Achievement Tests, Estimation (Mathematics), Measurement Techniques, Test Format

A Consideration for Variable Length Adaptive Tests.

Download full text

Wingersky, Marilyn S. – 1989

In a variable-length adaptive test with a stopping rule that relied on the asymptotic standard error of measurement of the examinee's estimated true score, M. S. Stocking (1987) discovered that it was sufficient to know the examinee's true score and the number of items administered to predict with some accuracy whether an examinee's true score was…

Descriptors: Adaptive Testing, Bayesian Statistics, Error of Measurement, Estimation (Mathematics)

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Peer reviewed

Kim, Seock-Ho; And Others – Psychometrika, 1994

Hierarchical Bayes procedures for the two-parameter logistic item response model were compared for estimating item and ability parameters through two joint and two marginal Bayesian procedures. Marginal procedures yielded smaller root mean square differences for item and ability, but results for larger sample size and test length were similar.…

Descriptors: Ability, Bayesian Statistics, Computer Simulation, Estimation (Mathematics)

Item Parameter Estimation Errors and Their Influence on Test Information Functions.

Peer reviewed

Hambleton, Ronald K.; Jones, Russell W. – Applied Measurement in Education, 1994

The impact of capitalizing on chance in item selection on the accuracy of test information functions was studied through simulation, focusing on examinee sample size in item calibration and the ratio of item bank size to test length. (SLD)

Descriptors: Computer Simulation, Estimation (Mathematics), Item Banks, Item Response Theory

Inferring Examinee Ability When Some Item Responses Are Missing.

Download full text

Mislevy, Robert J.; Wu, Pao-Kuei – 1988

The basic equations of item response theory provide a foundation for inferring examinees' abilities and items' operating characteristics from observed responses. In practice, though, examinees will usually not have provided a response to every available item--for reasons that may or may not have been intended by the test administrator, and that…

Descriptors: Ability, Adaptive Testing, Equations (Mathematics), Estimation (Mathematics)

The Nominal Response Model in Computerized Adaptive Testing.

Download full text

De Ayala, R. J. – 1992

One important and promising application of item response theory (IRT) is computerized adaptive testing (CAT). The implementation of a nominal response model-based CAT (NRCAT) was studied. Item pool characteristics for the NRCAT as well as the comparative performance of the NRCAT and a CAT based on the three-parameter logistic (3PL) model were…

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Monte Carlo Simulation Comparison of Two-Stage Testing and Computerized Adaptive Testing.

Download full text

Kim, Haeok; Plake, Barbara S. – 1993

A two-stage testing strategy is one method of adapting the difficulty of a test to an individual's ability level in an effort to achieve more precise measurement. A routing test provides an initial estimate of ability level, and a second-stage measurement test then evaluates the examinee further. The measurement accuracy and efficiency of item…

Descriptors: Ability, Adaptive Testing, Comparative Testing, Computer Assisted Testing

Estimating the Number of Examinees Who Did Not Reach the Last Item of a Section.

Wainer, Howard – 1985

It is important to estimate the number of examinees who reached a test item, because item difficulty is defined by the number who answered correctly divided by the number who reached the item. A new method is presented and compared to the previously used definition of three categories of response to an item: (1) answered; (2) omitted--a…

Descriptors: College Entrance Examinations, Difficulty Level, Estimation (Mathematics), High Schools

Bias and Information of Bayesian Adaptive Testing. Research Report 83-2.

Download full text

Weiss, David J.; McBride, James R. – 1983

Monte Carlo simulation was used to investigate score bias and information characteristics of Owen's Bayesian adaptive testing strategy, and to examine possible causes of score bias. Factors investigated in three related studies included effects of item discrimination, effects of fixed vs. variable test length, and effects of an accurate prior…

Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Test-Retest Analyses of the Test of English as a Foreign Language. TOEFL Research Reports Report 45.

Download full text

Henning, Grant – 1993

This study provides information about the total and component scores of the Test of English as a Foreign Language (TOEFL). First, the study provides comparative global and component estimates of test-retest, alternate-form, and internal-consistency reliability, controlling for sources of measurement error inherent in the examinees and the testing…

Descriptors: Difficulty Level, English (Second Language), Error of Measurement, Estimation (Mathematics)

The Selection of Test Items for Decision Making with a Computer Adaptive Test.

Download full text

Spray, Judith A.; Reckase, Mark D. – 1994

The issue of test-item selection in support of decision making in adaptive testing is considered. The number of items needed to make a decision is compared for two approaches: selecting items from an item pool that are most informative at the decision point or selecting items that are most informative at the examinee's ability level. The first…

Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing