Publication Date
| In 2026 | 0 |
| Since 2025 | 50 |
| Since 2022 (last 5 years) | 317 |
| Since 2017 (last 10 years) | 724 |
| Since 2007 (last 20 years) | 1793 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 73 |
| Practitioners | 22 |
| Teachers | 19 |
| Policymakers | 11 |
| Administrators | 5 |
| Students | 4 |
| Community | 1 |
| Media Staff | 1 |
Location
| Turkey | 54 |
| United States | 46 |
| Australia | 28 |
| United Kingdom | 21 |
| California | 19 |
| Canada | 19 |
| China | 16 |
| Texas | 16 |
| Germany | 14 |
| Nigeria | 14 |
| Taiwan | 14 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 4 |
Forster, Fred; Ingebo, George – 1978
Six monographs on the Rasch model are summarized. The first gives a historical perspective on the application of the Rasch model in the Portland, Oregon, metropolitan area. The remaining papers summarize research on the Rasch model. The research in Monograph II lead to the conclusion that random samples are not needed to calibrate item levels in…
Descriptors: Achievement Tests, Elementary Education, Field Tests, Item Banks
CLEARY, T.A.; LINN, ROBERT L. – 1967
THE PURPOSE OF THIS RESEARCH WAS TO STUDY THE EFFECT OF ERROR OF MEASUREMENT UPON THE POWER OF STATISTICAL TESTS. ATTENTION WAS FOCUSED ON THE F-TEST OF THE SINGLE FACTOR ANALYSIS OF VARIANCE. FORMULAS WERE DERIVED TO SHOW THE RELATIONSHIP BETWEEN THE NONCENTRALITY PARAMETERS FOR ANALYSES USING TRUE SCORES AND THOSE USING OBSERVED SCORES. THE…
Descriptors: Analysis of Variance, Error of Measurement, Measurement Techniques, Psychological Testing
PDF pending restorationReckase, Mark D. – 1979
Because latent trait models require that large numbers of items be calibrated or that testing of the same large group be repeated, item parameter estimates are often obtained by administering separate tests to different groups and "linking" the results to construct an adequate item pool. Four issues were studied, based upon the analysis…
Descriptors: Achievement Tests, High Schools, Item Banks, Mathematical Models
Kolen, Michael J.; Whitney, Douglas R. – 1978
The application of latent trait theory to classroom tests necessitates the use of small sample sizes for parameter estimation. Computer generated data were used to assess the accuracy of estimation of the slope and location parameters in the two parameter logistic model with fixed abilities and varying small sample sizes. The maximum likelihood…
Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models
Neel, John H.; Stallings, William M. – 1974
An influential statistics test recommends a Levene text for homogeneity of variance. A recent note suggests that Levene's test is upwardly biased for small samples. Another report shows inflated Alpha estimates and low power. Neither study utilized more than two sample sizes. This Monte Carlo study involved sampling from a normal population for…
Descriptors: Analysis of Variance, Educational Research, Hypothesis Testing, Monte Carlo Methods
Peer reviewedBroodbooks, Wendy J.; Elmore, Patricia B. – Educational and Psychological Measurement, 1987
The effects of sample size, number of variables, and population value of the congruence coefficient on the sampling distribution of the congruence coefficient were examined. Sample data were generated on the basis of the common factor model, and principal axes factor analyses were performed. (Author/LMO)
Descriptors: Factor Analysis, Mathematical Models, Monte Carlo Methods, Predictor Variables
Peer reviewedRoss, Kenneth N. – International Journal of Educational Research, 1987
This article considers various kinds of probability and non-probability samples in both experimental and survey studies. Throughout, how a sample is chosen is stressed. Size alone is not the determining consideration in sample selection. Good samples do not occur by accident; they are the result of a careful design. (Author/JAZ)
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Experimental Groups
Peer reviewedWilcox, Rand R.; Charlin, Ventura L. – Journal of Educational Statistics, 1986
This paper investigates three methods for comparing medians rather than means in studying two independent treatment groups. The method that gave the best results is based on a normal approximation of the distribution of the sample median where the variance is estimated using results reported by Maritz and Jarrett. (Author/JAZ)
Descriptors: Comparative Analysis, Computer Simulation, Computer Software, Equations (Mathematics)
Peer reviewedShepard, Lorrie A.; And Others – Journal of Educational Measurement, 1985
The purpose of this research was to recommend an item bias procedure when the number of minority examinees is too small to use preferred three-parameter item response theory (IRT) methods. The chi-square, Angoff delta-plot, and pseudo-IRT indices were compared with both real and simulated data. (Author/DWH)
Descriptors: Estimation (Mathematics), Item Analysis, Latent Trait Theory, Minority Groups
Peer reviewedJarjoura, David; Kolen, Michael J. – Journal of Educational Statistics, 1985
An equating design in which two groups of examinees from slightly different populations are administered a different test form with a subset of common items is widely used. This paper presents standard errors and a simulation that verifies the equation for large samples for an equipercentile equating procedure for this design. (Author/BS)
Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Estimation (Mathematics)
Shaughnessy, J. Michael; Ciancetta, Matt; Canada, Dan – International Group for the Psychology of Mathematics Education, 2004
As part of a research project on students' understanding of variability in statistics, 272 students, (84 middle school and 188 secondary school, grades 6-12) were surveyed on a series of tasks involving repeated sampling. Students' reasoning on the tasks predominantly fell into three types: additive, proportional, or distributional, depending on…
Descriptors: Sampling, Sample Size, Secondary School Students, Statistical Analysis
Tang, K. Linda; And Others – 1993
This study compared the performance of the LOGIST and BILOG computer programs on item response theory (IRT) based scaling and equating for the Test of English as a Foreign Language (TOEFL) using real and simulated data and two calibration structures. Applications of IRT for the TOEFL program are based on the three-parameter logistic (3PL) model.…
Descriptors: Comparative Analysis, Computer Simulation, Equated Scores, Estimation (Mathematics)
Ban, Jae-Chun; Hanson, Bradley A.; Wang, Tianyou; Yi, Qing; Harris, Deborah J. – 2000
The purpose of this study was to compare and evaluate five online pretest item calibration/scaling methods in computerized adaptive testing (CAT): (1) the marginal maximum likelihood estimate with one-EM cycle (OEM); (2) the marginal maximum likelihood estimate with multiple EM cycles (MEM); (3) Stocking's Method A (M. Stocking, 1988); (4)…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Estimation (Mathematics)
Witta, E. Lea; Brubaker, Craig – Online Submission, 2003
When studies are conducted over a period of time, the sample size typically decreases. In a study of the effects of exercise therapy and education with recovering congestive heart failure (CHF) patients (Brubaker, Witta, & Angelopoulus, 2003), the sample size decreased from over 40 to 9 participants after an 18-month time span. Although the…
Descriptors: Heart Disorders, Exercise, Health Education, Therapy
Peer reviewedYates, James R.; Ortiz, Alba A. – NABE: The Journal for the National Association for Bilingual Education, 1983
Significant questions are raised concerning the Baker/de Kanter review, which concluded that "the case for the effectiveness of transitional bilingual education is so weak that exclusive reliance on this instructional method is clearly not justified." Sources used show methodological problems and multiple examples of deficiences in nine…
Descriptors: Bias, Bilingual Education Programs, Elementary Secondary Education, Hispanic Americans


