ERIC - Search Results

Descriptor

Mathematical Models	15
Computer Simulation	11
Estimation (Mathematics)	6
Item Response Theory	6
Test Items	6
Equations (Mathematics)	5
Item Bias	4
Monte Carlo Methods	4
Simulation	4
Test Construction	4
Achievement Tests	3
Comparative Analysis	3
Elementary Secondary Education	3
Computer Assisted Testing	2
Equated Scores	2
Item Analysis	2
Latent Trait Theory	2
Sampling	2
Ability	1
Academic Ability	1
Analysis of Covariance	1
Bias	1
Causal Models	1
Classification	1
College Entrance Examinations	1
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	14
Reports - Research	7
Reports - Evaluative	6
Information Analyses	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Parameter Recovery in the Graded Response Model Using MULTILOG.

Peer reviewed

Reise, Steve P.; Yu, Jiayuan – Journal of Educational Measurement, 1990

Parameter recovery in the graded-response model was investigated using the MULTILOG computer program under default conditions. Results from 36 simulated data sets suggest that at least 500 examinees are needed to achieve adequate calibration under the graded model. Sample size had little influence on the true ability parameter's recovery. (SLD)

Descriptors: Computer Assisted Testing, Computer Simulation, Computer Software, Estimation (Mathematics)

Building a Unidimensional Test Using Multidimensional Items.

Peer reviewed

Reckase, Mark D.; And Others – Journal of Educational Measurement, 1988

It is demonstrated, theoretically and empirically, that item sets can be selected that meet the unidimensionality assumption of most item response theory models, even though they require more than one ability for a correct response. A method for identifying such item sets for test development purposes is presented. (SLD)

Descriptors: Computer Simulation, Item Analysis, Latent Trait Theory, Mathematical Models

A Monte Carlo Study of Item Sampling (Versus Traditional Sampling) for Norm Construction

Peer reviewed

Barcikowski, Robert S. – Journal of Educational Measurement, 1972

These results indicate that in deciding on the data-gathering design to be used in seeking norm information, attention should be given to item characteristics and test length with particular attention paid to the range of biserial correlations between item response and ability. (Author)

Descriptors: Item Sampling, Mathematical Models, Measurement Techniques, Monte Carlo Methods

Estimation of Classification Consistency When the Probability of a Correct Response Varies.

Peer reviewed

Spray, Judith A.; Welch, Catherine J. – Journal of Educational Measurement, 1990

The effect of large, within-examinee item difficulty variability on estimates of the proportion of consistent classification of examinees into mastery categories was studied over 2 test administrations for 100 simulated examinees. The proportion of consistent classifications was adequately estimated using the technique proposed by M. Subkoviak…

Descriptors: Classification, Difficulty Level, Estimation (Mathematics), Item Response Theory

Traditional Dimensionality versus Essential Dimensionality.

Peer reviewed

Nandakumar, Ratna – Journal of Educational Measurement, 1991

A statistical method, W. F. Stout's statistical test of essential unidimensionality (1990), for exploring the lack of unidimensionality in test data was studied using Monte Carlo simulations. The statistical procedure is a hypothesis test of whether the essential dimensionality is one or exceeds one, regardless of the traditional dimensionality.…

Descriptors: Ability, Achievement Tests, Computer Simulation, Equations (Mathematics)

A Comparison of Item Sampling Plans in the Application of Multiple Matrix Sampling.

Peer reviewed

Gressard, Risa P.; Loyd, Brenda H. – Journal of Educational Measurement, 1991

A Monte Carlo study, which simulated 10,000 examinees' responses to four tests, investigated the effect of item stratification on parameter estimation in multiple matrix sampling of achievement data. Practical multiple matrix sampling is based on item stratification by item discrimination and a sampling plan with moderate number of subtests. (SLD)

Descriptors: Achievement Tests, Comparative Testing, Computer Simulation, Estimation (Mathematics)

A Method of Estimating Rater Reliability.

Peer reviewed

van den Bergh, Huub; Eiting, Mindert H. – Journal of Educational Measurement, 1989

A method of assessing rater reliability via a design of overlapping rater teams is presented. Covariances or correlations of ratings can be analyzed with LISREL models. Models in which the rater reliabilities are congeneric, tau-equivalent, or parallel can be tested. Two examples based on essay ratings are presented. (TJH)

Descriptors: Analysis of Covariance, Computer Simulation, Correlation, Elementary Secondary Education

Simultaneous DIF Amplification and Cancellation: Shealy-Stout's Test for DIF.

Peer reviewed

Nandakumar, Ratna – Journal of Educational Measurement, 1993

The phenomenon of simultaneous differential item functioning (DIF) amplification and cancellation and the role of the SIBTEST approach in detecting DIF are investigated with a variety of simulated test data. The effectiveness of SIBTEST is supported, and the implications of DIF amplification and cancellation are discussed. (SLD)

Descriptors: Computer Simulation, Elementary Secondary Education, Equal Education, Equations (Mathematics)

Detecting Differential Item Functioning Using Logistic Regression Procedures.

Peer reviewed

Swaminathan, Hariharan; Rogers, H. Jane – Journal of Educational Measurement, 1990

A logistic regression model for characterizing differential item functioning (DIF) between two groups is presented. A distinction is drawn between uniform and nonuniform DIF in terms of model parameters. A statistic for testing the hypotheses of no DIF is developed, and simulation studies compare it with the Mantel-Haenszel procedure. (Author/TJH)

Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)

Assessing the Dimensionality of NAEP Reading Data.

Peer reviewed

Zwick, Rebecca – Journal of Educational Measurement, 1987

National Assessment of Educational Progress reading data were scaled using a unidimensional item response theory model. Bock's full-information factor analysis and Rosenbaum's test of unidimensionality were applied. Conclusions about unidimensionality for balanced incomplete block spiralled data were the same as for complete data. (Author/GDC)

Descriptors: Factor Analysis, Item Analysis, Latent Trait Theory, Mathematical Models

Comparison of Methods for Combining the Minimum Passing Levels for Individual Items into a Passing Score for a Test.

Peer reviewed

Plake, Barbara S.; Kane, Michael T. – Journal of Educational Measurement, 1991

Several methods for determining a passing score on an examination from individual raters' estimates of minimal pass levels were compared through simulation. The methods used differed in the weighting estimates for each item received in the aggregation process. Reasons why the simplest procedure is most preferred are discussed. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Cutting Scores, Estimation (Mathematics)

The Rasch Model in Vertical Equating of Tests: A Critique of Slinde and Linn.

Peer reviewed

Gustafsson, Jan-Eric – Journal of Educational Measurement, 1979

Computer generated data are used to show that Slinde and Linn's criticism of the usefulness of the Rasch model for equating (EJ 189 585) may have been the result of an artifact produced by the manner in which the samples were chosen in their study. (CTM)

Descriptors: Achievement Tests, Bias, College Entrance Examinations, Equated Scores

Multidimensional Equating.

Peer reviewed

Hirsch, Thomas M. – Journal of Educational Measurement, 1989

Equatings were performed on both simulated and real data sets using common-examinee design and two abilities for each examinee. Results indicate that effective equating, as measured by comparability of true scores, is possible with the techniques used in this study. However, the stability of the ability estimates proved unsatisfactory. (TJH)

Descriptors: Academic Ability, College Students, Comparative Analysis, Computer Assisted Testing

Assessment of Differential Item Functioning for Performance Tasks.

Peer reviewed

Zwick, Rebecca; And Others – Journal of Educational Measurement, 1993

Two extensions of the Mantel Haenszel procedure that may be useful in assessing differential item functioning (DIF) are explored. Simulation results showed that, for both inferential procedures, the studied item should be included in the matching variable, as in the dichotomous case. (SLD)

Descriptors: Computer Simulation, Educational Assessment, Elementary Secondary Education, Equations (Mathematics)

A Didactic Explanation of Item Bias, Item Impact, and Item Validity from a Multidimensional Perspective.

Peer reviewed

Ackerman, Terry A. – Journal of Educational Measurement, 1992

The difference between item bias and item impact and the way they relate to item validity are discussed from a multidimensional item response theory perspective. The Mantel-Haenszel procedure and the Simultaneous Item Bias strategy are used in a Monte Carlo study to illustrate detection of item bias. (SLD)

Descriptors: Causal Models, Computer Simulation, Construct Validity, Equations (Mathematics)

Nandakumar, Ratna	2
Zwick, Rebecca	2
Ackerman, Terry A.	1
Barcikowski, Robert S.	1
Eiting, Mindert H.	1
Gressard, Risa P.	1
Gustafsson, Jan-Eric	1
Hirsch, Thomas M.	1
Kane, Michael T.	1
Loyd, Brenda H.	1
Plake, Barbara S.	1
Reckase, Mark D.	1
Reise, Steve P.	1
Rogers, H. Jane	1
Spray, Judith A.	1
Swaminathan, Hariharan	1
Welch, Catherine J.	1
Yu, Jiayuan	1
van den Bergh, Huub	1
More ▼