Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Computer Simulation | 19 |
| Equated Scores | 19 |
| Mathematical Models | 10 |
| Error of Measurement | 9 |
| Test Items | 8 |
| Comparative Analysis | 7 |
| Item Response Theory | 7 |
| Sample Size | 7 |
| Estimation (Mathematics) | 4 |
| Latent Trait Theory | 4 |
| Statistical Studies | 4 |
| More ▼ | |
Author
| Zeng, Lingjia | 4 |
| Baker, Frank B. | 1 |
| Cleary, T. Anne | 1 |
| Cohen, Allan S. | 1 |
| Du Bose, Pansy | 1 |
| Eignor, Daniel R. | 1 |
| Fitzpatrick, Steven J. | 1 |
| Gilmer, Jerry S. | 1 |
| Hedges, Larry V. | 1 |
| Hirsch, Thomas M. | 1 |
| Hu, Huiqin | 1 |
| More ▼ | |
Publication Type
| Reports - Evaluative | 10 |
| Reports - Research | 9 |
| Journal Articles | 8 |
| Speeches/Meeting Papers | 6 |
Education Level
Audience
| Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
| SAT (College Admission Test) | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Hu, Huiqin; Rogers, W. Todd; Vukmirovic, Zarko – Applied Psychological Measurement, 2008
Common items with inconsistent b-parameter estimates may have a serious impact on item response theory (IRT)--based equating results. To find a better way to deal with the outlier common items with inconsistent b-parameters, the current study investigated the comparability of 10 variations of four IRT-based equating methods (i.e., concurrent…
Descriptors: Item Response Theory, Item Analysis, Computer Simulation, Equated Scores
Peer reviewedZeng, Lingjia; And Others – Applied Psychological Measurement, 1994
A general delta method is described for computing the standard error (SE) of a chain of linear equations. The general delta method derives the SEs directly from the moments of the score distributions obtained in the equating chain. Computer simulations demonstrate the method. (SLD)
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Statistical Distributions
Peer reviewedZeng, Lingjia – Applied Psychological Measurement, 1995
The effects of different degrees of smoothing on results of equipercentile equating in random groups design using a postsmoothing method based on cubic splines were investigated, and a computer-based procedure was introduced for selecting a desirable degree of smoothing. Results suggest that no particular degree of smoothing was always optimal.…
Descriptors: Computer Simulation, Computer Software, Equated Scores, Research Methodology
Hedges, Larry V.; Vevea, Jack L. – 2003
A computer simulation study was conducted to investigate the amount of uncertainty added to National Assessment of Educational Progress estimates by equating error under three different equating methods and while varying a number of factors that might affect accuracy of equating. Data from past NAEP administrations were used to guide the…
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Item Response Theory
Peer reviewedBaker, Frank B. – Applied Psychological Measurement, 1990
The equating of results from the PC-BILOG computer program to an underlying metric was studied through simulation when a two-parameter item response theory model was used. Results are discussed in terms of the identification problem and implications for test equating. (SLD)
Descriptors: Bayesian Statistics, Computer Simulation, Equated Scores, Item Response Theory
Peer reviewedZeng, Lingjia – Applied Psychological Measurement, 1993
A numerical approach for computing standard errors (SEs) of a linear equating is described in which first partial derivatives of equating functions needed to compute SEs are derived numerically. Numerical and analytical approaches are compared using the Tucker equating method. SEs derived numerically are found indistinguishable from SEs derived…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Equations (Mathematics)
Peer reviewedLiou, Michelle – Applied Psychological Measurement, 1988
In applying I. I. Bejar's method for detecting the dimensionality of achievement tests, researchers should be cautious in interpreting the slope of the principal axis. Other information from the data is needed in conjunction with Bejar's method of addressing item dimensionality. (SLD)
Descriptors: Achievement Tests, Computer Simulation, Difficulty Level, Equated Scores
Morrison, Carol A.; Fitzpatrick, Steven J. – 1992
An attempt was made to determine which item response theory (IRT) equating method results in the least amount of equating error or "scale drift" when equating scores across one or more test forms. An internal anchor test design was employed with five different test forms, each consisting of 30 items, 10 in common with the base test and 5…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Error of Measurement
Hwang, Chi-en; Cleary, T. Anne – 1986
The results obtained from two basic types of pre-equatings of tests were compared: the item response theory (IRT) pre-equating and section pre-equating (SPE). The simulated data were generated from a modified three-parameter logistic model with a constant guessing parameter. Responses of two replication samples of 3000 examinees on two 72-item…
Descriptors: Computer Simulation, Equated Scores, Latent Trait Theory, Mathematical Models
Peer reviewedJarjoura, David; Kolen, Michael J. – Journal of Educational Statistics, 1985
An equating design in which two groups of examinees from slightly different populations are administered a different test form with a subset of common items is widely used. This paper presents standard errors and a simulation that verifies the equation for large samples for an equipercentile equating procedure for this design. (Author/BS)
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Estimation (Mathematics)
Tang, K. Linda; And Others – 1993
This study compared the performance of the LOGIST and BILOG computer programs on item response theory (IRT) based scaling and equating for the Test of English as a Foreign Language (TOEFL) using real and simulated data and two calibration structures. Applications of IRT for the TOEFL program are based on the three-parameter logistic (3PL) model.…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Estimation (Mathematics)
Skaggs, Gary; Lissitz, Robert W. – 1985
This study examined how four commonly used test equating procedures (linear, equipercentile, Rasch Model, and three-parameter) would respond to situations in which the properties or the two tests being equated were different. Data for two tests plus an external anchor test were generated from a three parameter model in which mean test differences…
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Goodness of Fit
Zeng, Lingjia – 1991
Large sample standard errors of linear equating for the single-group design are derived without making the normality assumption. Two general methods based on the delta method of M. Kendall and A. Stuart (1977) are described. One method uses the exact partial derivatives, and the other uses numerical derivatives. Simulation using the beta-binomial…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Equations (Mathematics)
Cohen, Allan S.; Kim, Seock-Ho – 1993
Equating tests from different calibrations under item response theory (IRT) requires calculation of the slope and intercept of the appropriate linear transformation. Two methods have been proposed recently for equating graded response items under IRT, a test characteristic curve method and a minimum chi-square method. These two methods are…
Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Equated Scores
Stocking, Martha L.; Eignor, Daniel R. – 1986
In item response theory (IRT), preequating depends upon item parameter estimate invariance. Three separate simulations, all using the unidimensional three-parameter logistic item response model, were conducted to study the impact of the following variables on preequating: (1) mean differences in ability; (2) multidimensionality in the data; and…
Descriptors: College Entrance Examinations, Computer Simulation, Equated Scores, Error of Measurement
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
