ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	21
Since 2017 (last 10 years)	50
Since 2007 (last 20 years)	98

Descriptor

Sample Size	139
Test Length	139
Item Response Theory	93
Test Items	66
Simulation	44
Comparative Analysis	31
Error of Measurement	30
Statistical Analysis	28
Monte Carlo Methods	27
Computation	26
Models	26
Correlation	25
Goodness of Fit	25
Accuracy	23
Test Bias	22
Difficulty Level	18
Equated Scores	18
Ability	15
Statistical Bias	15
Maximum Likelihood Statistics	13
Test Construction	13
Item Analysis	12
Statistical Distributions	12
Bayesian Statistics	11
Estimation (Mathematics)	11
More ▼

Publication Type

Journal Articles	98
Reports - Research	97
Reports - Evaluative	29
Speeches/Meeting Papers	26
Dissertations/Theses -…	10
Reports - Descriptive	2
Guides - Non-Classroom	1
Numerical/Quantitative Data	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	2

Audience

Researchers

Location

Taiwan	2
Turkey	2
Colombia	1
Indonesia	1
Jordan	1
Peru	1
Qatar	1

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	3
Program for International…	2
Comprehensive Tests of Basic…	1
Iowa Tests of Basic Skills	1
SAT (College Admission Test)	1
Trends in International…	1

What Works Clearinghouse Rating

Sample Size X

Showing 121 to 135 of 139 results Save | Export

The Influence of Multidimensionality on the Graded Response Model.

Download full text

De Ayala, R. J. – 1993

Previous work on the effects of dimensionality on parameter estimation was extended from dichotomous models to the polytomous graded response (GR) model. A multidimensional GR model was developed to generate data in one-, two-, and three-dimensions, with two- and three-dimensional conditions varying in their interdimensional associations. Test…

Descriptors: Computer Simulation, Correlation, Difficulty Level, Estimation (Mathematics)

Comparing the IRT Pre-equating and Section Pre-equating: A Simulation Study.

Hwang, Chi-en; Cleary, T. Anne – 1986

The results obtained from two basic types of pre-equatings of tests were compared: the item response theory (IRT) pre-equating and section pre-equating (SPE). The simulated data were generated from a modified three-parameter logistic model with a constant guessing parameter. Responses of two replication samples of 3000 examinees on two 72-item…

Descriptors: Computer Simulation, Equated Scores, Latent Trait Theory, Mathematical Models

Dependent Variable Reliability and Determination of Sample Size.

Maxwell, Scott E. – 1979

Arguments have recently been put forth that standard textbook procedures for determining the sample size necessary to achieve a certain level of power in a completely randomized design are incorrect when the dependent variable is fallible because they ignore measurement error. In fact, however, there are several correct procedures, one of which is…

Descriptors: Hypothesis Testing, Mathematical Formulas, Power (Statistics), Predictor Variables

Item Pool Construction for Use With Latent Trait Models.

PDF pending restoration

Reckase, Mark D. – 1979

Because latent trait models require that large numbers of items be calibrated or that testing of the same large group be repeated, item parameter estimates are often obtained by administering separate tests to different groups and "linking" the results to construct an adequate item pool. Four issues were studied, based upon the analysis…

Descriptors: Achievement Tests, High Schools, Item Banks, Mathematical Models

Second-Order Confirmatory Factor Analysis of the "Reactions to Tests" Scale with Cross-Validation.

Peer reviewed

Benson, Jeri; Bandalos, Deborah L. – Multivariate Behavioral Research, 1992

Factor structure of the Reactions to Tests (RTT) scale measuring test anxiety was studied by testing a series of confirmatory factor models including a second-order structure with 636 college students. Results support a shorter 20-item RTT but also raise questions about the cross-validation of covariance models. (SLD)

Descriptors: College Students, Factor Analysis, Factor Structure, Higher Education

Markov Chain Monte Carlo Estimation of Item Parameters for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

de la Torre, Jimmy; Stark, Stephen; Chernyshenko, Oleksandr S. – Applied Psychological Measurement, 2006

The authors present a Markov Chain Monte Carlo (MCMC) parameter estimation procedure for the generalized graded unfolding model (GGUM) and compare it to the marginal maximum likelihood (MML) approach implemented in the GGUM2000 computer program, using simulated and real personality data. In the simulation study, test length, number of response…

Descriptors: Computation, Monte Carlo Methods, Markov Processes, Item Response Theory

Assessing the Dimensionality of Item Response Matrices with Small Sample Sizes and Short Test Lengths.

Download full text

De Champlain, Andre; Gessaroli, Marc E. – 1996

The use of indices and statistics based on nonlinear factor analysis (NLFA) has become increasingly popular as a means of assessing the dimensionality of an item response matrix. Although the indices and statistics currently available to the practitioner have been shown to be useful and accurate in many testing situations, few studies have…

Descriptors: Adaptive Testing, Chi Square, Computer Assisted Testing, Factor Analysis

A Monte Carlo Study of Marginal Maximum Likelihood Parameter Estimates for the Graded Model.

Download full text

Ankenmann, Robert D.; Stone, Clement A. – 1992

Effects of test length, sample size, and assumed ability distribution were investigated in a multiple replication Monte Carlo study under the 1-parameter (1P) and 2-parameter (2P) logistic graded model with five score levels. Accuracy and variability of item parameter and ability estimates were examined. Monte Carlo methods were used to evaluate…

Descriptors: Computer Simulation, Estimation (Mathematics), Item Bias, Mathematical Models

Examining Replication Effects in Rasch Fit Statistics.

Download full text

Schumacker, Randall E.; And Others – 1994

Rasch between and total weighted and unweighted fit statistics were compared using varying test lengths and sample sizes. Two test lengths (20 and 50 items) and three sample sizes (150, 500, and 1,000 were crossed. Each of the six combinations were replicated 100 times. In addition, power comparisons were made. Results indicated that there were no…

Descriptors: Comparative Analysis, Goodness of Fit, Item Response Theory, Power (Statistics)

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

The Standardized Mean Difference within the Framework of Item Response Theory

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004

As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…

Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement

A Comparison of Equating Methods under the Graded Response Model.

Download full text

Cohen, Allan S.; Kim, Seock-Ho – 1993

Equating tests from different calibrations under item response theory (IRT) requires calculation of the slope and intercept of the appropriate linear transformation. Two methods have been proposed recently for equating graded response items under IRT, a test characteristic curve method and a minimum chi-square method. These two methods are…

Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Equated Scores

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

A Method for Determining the Length of Criterion-Referenced Tests Using Reliability and Validity Indices.

Download full text

Mills, Craig N.; Simon, Robert – 1981

When criterion-referenced tests are used to assign examinees to states reflecting their performance level on a test, the better known methods for determining test length, which consider relationships among domain scores and errors of measurement, have their limitations. The purpose of this paper is to present a computer system named TESTLEN, which…

Descriptors: Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores, Error of Measurement

An Investigation of Hierarchical Bayes Procedures in Item Response Theory.

Download full text

Kim, Seock-Ho; And Others – 1992

Hierarchical Bayes procedures were compared for estimating item and ability parameters in item response theory. Simulated data sets from the two-parameter logistic model were analyzed using three different hierarchical Bayes procedures: (1) the joint Bayesian with known hyperparameters (JB1); (2) the joint Bayesian with information hyperpriors…

Descriptors: Ability, Bayesian Statistics, Comparative Analysis, Equations (Mathematics)

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	30
Applied Psychological…	14
Applied Measurement in…	11
ProQuest LLC	10
International Journal of…	7
Journal of Educational…	7
Measurement:…	6
Educational Sciences: Theory…	5
ETS Research Report Series	3
International Journal of…	3
Journal of Educational and…	2
Journal of Experimental…	2
ACT, Inc.	1
Educational Testing Service	1
Eurasian Journal of…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Multivariate Behavioral…	1
Participatory Educational…	1
Psychometrika	1
Quality Assurance in…	1
Turkish Journal of Education	1
More ▼

Gessaroli, Marc E.	5
Lee, Won-Chan	5
Hambleton, Ronald K.	4
Kim, Seock-Ho	4
Wells, Craig S.	4
Cohen, Allan S.	3
De Champlain, Andre	3
De Champlain, Andre F.	3
Schumacker, Randall E.	3
Uysal, Ibrahim	3
Chon, Kyong Hee	2
De Ayala, R. J.	2
DeMars, Christine E.	2
Drasgow, Fritz	2
Finch, Holmes	2
Huggins-Manley, Anne Corinne	2
Kelecioglu, Hülya	2
Kilic, Abdullah Faruk	2
Kiliç, Abdullah Faruk	2
Lee, Yi-Hsuan	2
Liang, Tie	2
Nandakumar, Ratna	2
Paek, Insu	2
Sengul Avsar, Asiye	2
More ▼