ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	27
Since 2007 (last 20 years)	76

Descriptor

Simulation	107
Test Length	107
Item Response Theory	65
Test Items	51
Sample Size	44
Computer Assisted Testing	28
Comparative Analysis	27
Adaptive Testing	23
Computation	19
Goodness of Fit	19
Models	19
Correlation	17
Error of Measurement	17
Test Bias	16
Accuracy	15
Evaluation Methods	15
Ability	14
Probability	14
Bayesian Statistics	13
Item Analysis	13
Scores	13
Classification	12
Statistical Analysis	12
Test Construction	11
Difficulty Level	10
More ▼

Publication Type

Journal Articles	70
Reports - Research	65
Reports - Evaluative	25
Speeches/Meeting Papers	15
Dissertations/Theses -…	14
Reports - Descriptive	2
Information Analyses	1
Numerical/Quantitative Data	1
Reports - General	1

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
Secondary Education	1

Audience

Location

Netherlands	1
Taiwan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	3
Advanced Placement…	1
Armed Forces Qualification…	1
COMPASS (Computer Assisted…	1
Center for Epidemiologic…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Simulation X

Showing 76 to 90 of 107 results Save | Export

More on the Computation of Higher-Order Derivatives of the Elementary Symmetric Functions in the Rasch Model.

Peer reviewed

Liou, Michelle – Applied Psychological Measurement, 1994

A recursive equation is proposed for computing higher order derivatives of elementary symmetric functions in the Rasch model. A simulation study indicates a small loss in accuracy for the proposed formula compared to Gustafsson's sum algorithm (1980) for computing higher order derivatives when tests contain 60 items or less. (SLD)

Descriptors: Algorithms, Computation, Item Response Theory, Simulation

Application of Sequential Interval Estimation to Adaptive Mastery Testing

Peer reviewed

Direct link

Chang, Yuan-chin Ivan – Psychometrika, 2005

In this paper, we apply sequential one-sided confidence interval estimation procedures with beta-protection to adaptive mastery testing. The procedures of fixed-width and fixed proportional accuracy confidence interval estimation can be viewed as extensions of one-sided confidence interval procedures. It can be shown that the adaptive mastery…

Descriptors: Mastery Tests, Probability, Intervals, Testing

Assessing the Dimensionality of Item Response Matrices with Small Sample Sizes and Short Test Lengths.

Peer reviewed

De Champlain, Andre; Gessaroli, Marc E. – Applied Measurement in Education, 1998

Type I error rates and rejection rates for three-dimensionality assessment procedures were studied with data sets simulated to reflect short tests and small samples. Results show that the G-squared difference test (D. Bock, R. Gibbons, and E. Muraki, 1988) suffered from a severely inflated Type I error rate at all conditions simulated. (SLD)

Descriptors: Item Response Theory, Matrices, Sample Size, Simulation

A Description and Demonstration of the Polytomous-DFIT Framework.

Download full text

Flowers, Claudia P.; And Others – 1996

N. S. Raju, W. J. van der Linden, and P. F. Fleer (in press) have proposed an item response theory-based, parametric procedure for the detection of differential item functioning (DIF)/differential test functioning (DTF) known as differential functioning of item and test (DFIT). DFIT can be used with dichotomous, polytomous, or multidimensional…

Descriptors: Item Response Theory, Mathematical Models, Simulation, Test Bias

Detection of Cheating on Multiple-Choice Examinations.

Download full text

Bay, Luz – 1995

An index is proposed to detect cheating on multiple-choice examinations, and its use is evaluated through simulations. The proposed index is based on the compound binomial distribution. In total, 360 simulated data sets reflecting 12 different cheating (copying) situations were obtained and used for the study of the sensitivity of the index in…

Descriptors: Cheating, Class Size, Identification, Multiple Choice Tests

An Investigation of Factors Affecting Test Equating in Latent Trait Theory.

Peer reviewed

Sunathong, Surintorn; Schumacker, Randall E.; Beyerlein, Michael M. – Journal of Applied Measurement, 2000

Studied five factors that can affect the equating of scores from two tests onto a common score scale through the simulation and equating of 4,860 item data sets. Findings indicate three statistically significant two-way interactions for common item length and test length, item difficulty standard deviation and item distribution type, and item…

Descriptors: Difficulty Level, Equated Scores, Interaction, Item Response Theory

The Effects of Test Length and Sample Size on the Reliability and Equating of Tests Composed of Constructed-Response Items.

Peer reviewed

Fitzpatrick, Anne R.; Yen, Wendy M. – Applied Measurement in Education, 2001

Examined the effects of test length and sample size on the alternate forms reliability and equating of simulated mathematics tests composed of constructed response items scaled using the two-parameter partial credit model. Results suggest that, to obtain acceptable reliabilities and accurate equated scores, tests should have at least 8 6-point…

Descriptors: Constructed Response, Equated Scores, Mathematics Tests, Reliability

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

A Comparison of Logistic Regression and Analysis of Variance Differential Item Functioning Decision Methods.

Peer reviewed

Whitmore, Marjorie L.; Schumacker, Randall E. – Educational and Psychological Measurement, 1999

Compared differential item functioning detection rates for logistic regression and analysis of variance for dichotomously scored items using simulated data and varying test length, sample size, discrimination rate, and underlying ability. Explains why the logistic regression method is recommended for most applications. (SLD)

Descriptors: Ability, Analysis of Variance, Comparative Analysis, Item Bias

The Effect of Person Misfit on Classification Decisions

Peer reviewed

Direct link

Hendrawan, Irene; Glas, Cees A. W.; Meijer, Rob R. – Applied Psychological Measurement, 2005

The effect of person misfit to an item response theory model on a mastery/nonmastery decision was investigated. Furthermore, it was investigated whether the classification precision can be improved by identifying misfitting respondents using person-fit statistics. A simulation study was conducted to investigate the probability of a correct…

Descriptors: Probability, Statistics, Test Length, Simulation

Assessing the Dimensionality of Item Response Matrices Using a Goodness-of-Fit Index Based on Noncentrality.

Download full text

De Champlain, Andre – 1996

The usefulness of a goodness-of-fit index proposed by R. P. McDonald (1989) was investigated with regard to assessing the dimensionality of item response matrices. The m subscript k index, which is based on an estimate of the noncentrality parameter of the noncentral chi-square distribution, possesses several advantages over traditional tests of…

Descriptors: Chi Square, Cutting Scores, Goodness of Fit, Item Response Theory

Assessing the Dimensionality of Simulated LSAT Item Response Matrices with Small Sample Sizes and Short Test Lengths. Law School Admission Council Computerized Testing Report. LSAC Research Report Series.

Download full text

De Champlain, Andre F. – 1999

The purpose of this study was to examine empirical Type I error rates and rejection rates for three dimensionality assessment procedures with data sets simulated to reflect short tests and small samples. The TESTFACT G superscript 2 difference test suffered from an inflated Type I error rate with unidimensional data sets, while the approximate chi…

Descriptors: Admission (School), College Entrance Examinations, Item Response Theory, Law Schools

Evaluation of Implied Orders as a Basis for Tailored Testing with Simulation Data.

Peer reviewed

Cliff, Norman; And Others – Applied Psychological Measurement, 1979

Monte Carlo research with TAILOR, a program using implied orders as a basis for tailored testing, is reported. TAILOR typically required about half the available items to estimate, for each simulated examinee, the responses on the remainder. (Author/CTM)

Descriptors: Adaptive Testing, Computer Programs, Item Sampling, Nonparametric Statistics

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

How Big Is Big Enough? Sample Size Requirements for CAST Item Parameter Estimation

Peer reviewed

Direct link

Chuah, Siang Chee; Drasgow, Fritz; Luecht, Richard – Applied Measurement in Education, 2006

Adaptive tests offer the advantages of reduced test length and increased accuracy in ability estimation. However, adaptive tests require large pools of precalibrated items. This study looks at the development of an item pool for 1 type of adaptive administration: the computer-adaptive sequential test. An important issue is the sample size required…

Descriptors: Test Length, Sample Size, Adaptive Testing, Item Response Theory

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Applied Psychological…	17
Educational and Psychological…	16
ProQuest LLC	14
Journal of Educational…	7
Applied Measurement in…	6
International Journal of…	5
ETS Research Report Series	4
Educational Sciences: Theory…	2
Grantee Submission	2
Measurement:…	2
Psychometrika	2
Education Sciences	1
Education and Information…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Educational and…	1
Pearson	1
Quality Assurance in…	1
Turkish Journal of Education	1
More ▼

Cheng, Ying	4
Hambleton, Ronald K.	4
Wang, Wen-Chung	4
De Champlain, Andre	3
Drasgow, Fritz	3
Schumacker, Randall E.	3
Tay, Louis	3
Wells, Craig S.	3
Chun Wang	2
Cliff, Norman	2
Cui, Ying	2
Gessaroli, Marc E.	2
Kelecioglu, Hülya	2
Lathrop, Quinn N.	2
Meijer, Rob R.	2
Paek, Insu	2
Sijtsma, Klaas	2
Steinheiser, Frederick H., Jr.	2
Weiss, David J.	2
Yao, Lihua	2
A. Corinne Huggins-Manley	1
Ackerman, Terry	1
Ames, Allison J.	1
Andersson, Björn	1
More ▼