ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	27
Since 2007 (last 20 years)	76

Descriptor

Simulation	107
Test Length	107
Item Response Theory	65
Test Items	51
Sample Size	44
Computer Assisted Testing	28
Comparative Analysis	27
Adaptive Testing	23
Computation	19
Goodness of Fit	19
Models	19
Correlation	17
Error of Measurement	17
Test Bias	16
Accuracy	15
Evaluation Methods	15
Ability	14
Probability	14
Bayesian Statistics	13
Item Analysis	13
Scores	13
Classification	12
Statistical Analysis	12
Test Construction	11
Difficulty Level	10
More ▼

Publication Type

Journal Articles	70
Reports - Research	65
Reports - Evaluative	25
Speeches/Meeting Papers	15
Dissertations/Theses -…	14
Reports - Descriptive	2
Information Analyses	1
Numerical/Quantitative Data	1
Reports - General	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
Secondary Education	1

Audience

Location

Netherlands	1
Taiwan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	3
Advanced Placement…	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Center for Epidemiologic…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Simulation X

Showing 91 to 105 of 107 results Save | Export

Assessing the Dimensionality of Item Response Matrices with Small Sample Sizes and Short Test Lengths.

Download full text

De Champlain, Andre; Gessaroli, Marc E. – 1996

The use of indices and statistics based on nonlinear factor analysis (NLFA) has become increasingly popular as a means of assessing the dimensionality of an item response matrix. Although the indices and statistics currently available to the practitioner have been shown to be useful and accurate in many testing situations, few studies have…

Descriptors: Adaptive Testing, Chi Square, Computer Assisted Testing, Factor Analysis

Examining Replication Effects in Rasch Fit Statistics.

Download full text

Schumacker, Randall E.; And Others – 1994

Rasch between and total weighted and unweighted fit statistics were compared using varying test lengths and sample sizes. Two test lengths (20 and 50 items) and three sample sizes (150, 500, and 1,000 were crossed. Each of the six combinations were replicated 100 times. In addition, power comparisons were made. Results indicated that there were no…

Descriptors: Comparative Analysis, Goodness of Fit, Item Response Theory, Power (Statistics)

Simulated and Empirical Studies of Flexilevel Testing in Air Force Technical Training Courses. Final Report for Period 1 May 1975-30 April 1977.

Harris, Dickie A.; Penell, Roger J. – 1977

This study used a series of simulations to answer questions about the efficacy of adaptive testing raised by empirical studies. The first study showed that for reasonable high entry points, parameters estimated from paper-and-pencil test protocols cross-validated remarkably well to groups actually tested at a computer terminal. This suggested that…

Descriptors: Adaptive Testing, Computer Assisted Testing, Cost Effectiveness, Difficulty Level

Pretesting alongside an Operational CAT.

Download full text

Davey, Tim; Pommerich, Mary; Thompson, Tony D. – 1999

In computerized adaptive testing (CAT), new or experimental items are frequently administered alongside operational tests to gather the pretest data needed to replenish and replace item pools. The two basic strategies used to combine pretest and operational items are embedding and appending. Variable-length CATs are preferred because of the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Measurement Techniques

Factors Influencing the Mantel and Generalized Mantel-Haenszel Methods for the Assessment of Differential Item Functioning in Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Su, Ya-Hui – Applied Psychological Measurement, 2004

Eight independent variables (differential item functioning [DIF] detection method, purification procedure, item response model, mean latent trait difference between groups, test length, DIF pattern, magnitude of DIF, and percentage of DIF items) were manipulated, and two dependent variables (Type I error and power) were assessed through…

Descriptors: Test Length, Test Bias, Simulation, Item Response Theory

A Bayesian Method for Evaluating Trainee Proficiency. Technical Paper 323.

Download full text

Epstein, Kenneth I.; Steinheiser, Frederick H., Jr. – 1978

A multiparameter, programmable model was developed to examine the interactive influence of certain parameters on the probability of deciding that an examinee had attained a specified degree of mastery. It was applied within the simulated context of performance testing of military trainees. These parameters included: (1) the number of assumed…

Descriptors: Academic Ability, Bayesian Statistics, Cutting Scores, Hypothesis Testing

A Bayesian Simulation for Determining Mastery Calssification Accuracy.

Download full text

Steinheiser, Frederick H., Jr. – 1976

A computer simulation of Bayes' Theorem was conducted in order to determine the probability that an examinee was a master conditional upon his test score. The inputs were: number of mastery states assumed, test length, prior expectation of masters in the examinee population, and conditional probability of a master getting a randomly selected test…

Descriptors: Bayesian Statistics, Classification, Computer Programs, Criterion Referenced Tests

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

Effects of Test Length and Advancement Score on Several Criterion-Referenced Test Reliability and Validity Indices. Laboratory of Psychometric and Evaluation Research Report No. 86.

Download full text

Eignor, Daniel R.; Hambleton, Ronald K. – 1979

The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…

Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

Bias and Information of Bayesian Adaptive Testing. Research Report 83-2.

Download full text

Weiss, David J.; McBride, James R. – 1983

Monte Carlo simulation was used to investigate score bias and information characteristics of Owen's Bayesian adaptive testing strategy, and to examine possible causes of score bias. Factors investigated in three related studies included effects of item discrimination, effects of fixed vs. variable test length, and effects of an accurate prior…

Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing

Setting Standards on Performance Assessments: Promising New Methods and Technical Issues.

Download full text

Hambleton, Ronald K. – 1995

Performance assessments in education and credentialing are becoming popular. At the same time, there do not exist any well established and validated methods for setting standards on performance assessments. This paper describes several of the new standard-setting methods that are emerging for use with performance assessments and considers their…

Descriptors: Achievement Tests, Cutting Scores, Holistic Evaluation, Licensing Examinations (Professions)

Test Speededness under Number-Right Scoring: An Analysis of the Test of English as a Foreign Language.

Download full text

Bejar, Isaac I. – 1985

The Test of English as a Foreign Language (TOEFL) was used in this study, which attempted to develop a new methodology for assessing the speededness of right-scored tests. Traditional procedures of assessing speededness have assumed that the test is scored under formula-scoring instructions; this approach is not always appropriate. In this study,…

Descriptors: College Entrance Examinations, English (Second Language), Estimation (Mathematics), Evaluation Methods

An Adaptive Testing Strategy for Achievement Test Batteries. Research Report 77-6.

Download full text

Brown, Joel M.; Weiss, David J. – 1977

An adaptive testing strategy is described for achievement tests covering multiple content areas. The strategy combines adaptive item selection both within and between the subtests in the multiple-subtest battery. A real-data simulation was conducted to compare the results from adaptive testing and from conventional testing, in terms of test…

Descriptors: Achievement Tests, Adaptive Testing, Branching, Comparative Analysis

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Applied Psychological…	17
Educational and Psychological…	16
ProQuest LLC	14
Journal of Educational…	7
Applied Measurement in…	6
International Journal of…	5
ETS Research Report Series	4
Educational Sciences: Theory…	2
Grantee Submission	2
Measurement:…	2
Psychometrika	2
Education Sciences	1
Education and Information…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Educational and…	1
Pearson	1
Quality Assurance in…	1
Turkish Journal of Education	1
More ▼

Cheng, Ying	4
Hambleton, Ronald K.	4
Wang, Wen-Chung	4
De Champlain, Andre	3
Drasgow, Fritz	3
Schumacker, Randall E.	3
Tay, Louis	3
Wells, Craig S.	3
Chun Wang	2
Cliff, Norman	2
Cui, Ying	2
Gessaroli, Marc E.	2
Kelecioglu, Hülya	2
Lathrop, Quinn N.	2
Meijer, Rob R.	2
Paek, Insu	2
Sijtsma, Klaas	2
Steinheiser, Frederick H., Jr.	2
Weiss, David J.	2
Yao, Lihua	2
A. Corinne Huggins-Manley	1
Ackerman, Terry	1
Ames, Allison J.	1
Andersson, Björn	1
More ▼