Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 20 |
| Since 2007 (last 20 years) | 44 |
Descriptor
| Statistical Analysis | 71 |
| Test Length | 71 |
| Item Response Theory | 30 |
| Test Items | 29 |
| Sample Size | 28 |
| Comparative Analysis | 16 |
| Test Reliability | 16 |
| Correlation | 15 |
| Error of Measurement | 13 |
| Scores | 13 |
| Computation | 12 |
| More ▼ | |
Source
Author
| Bulut, Okan | 2 |
| Cohen, Allan S. | 2 |
| Huggins-Manley, Anne Corinne | 2 |
| Paek, Insu | 2 |
| Svetina, Dubravka | 2 |
| Tay, Louis | 2 |
| Wang, Wen-Chung | 2 |
| Weiss, David J. | 2 |
| Yormaz, Seha | 2 |
| de Jong, John H. A. L. | 2 |
| Abad, Francisco J. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 55 |
| Journal Articles | 47 |
| Reports - Evaluative | 9 |
| Speeches/Meeting Papers | 5 |
| Dissertations/Theses -… | 3 |
| Tests/Questionnaires | 2 |
| Information Analyses | 1 |
| Numerical/Quantitative Data | 1 |
Education Level
| Higher Education | 5 |
| Postsecondary Education | 4 |
| Secondary Education | 3 |
| Elementary Education | 1 |
| Elementary Secondary Education | 1 |
| Grade 3 | 1 |
| High Schools | 1 |
Audience
| Researchers | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| SAT (College Admission Test) | 2 |
| California Psychological… | 1 |
| Program for International… | 1 |
| Stanford Binet Intelligence… | 1 |
| Test of English as a Foreign… | 1 |
| Wechsler Adult Intelligence… | 1 |
| Wechsler Individual… | 1 |
What Works Clearinghouse Rating
Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007
Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…
Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models
Wollack, James A. – Applied Measurement in Education, 2006
Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…
Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size
Peer reviewedRowley, Glenn – Journal of Educational Measurement, 1978
The reliabilities of various observational measures were determined, and the influence of both the number and the length of the observation periods on reliability was examined, both separately and jointly. A single simplifying assumption leads to a variant of the Spearman-Brown formula, which may have wider application. (Author/CTM)
Descriptors: Career Development, Classroom Observation Techniques, Observation, Reliability
Peer reviewedBudescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests
Peer reviewedWilcox, Rand R. – Journal of Educational Statistics, 1979
Methods are described for obtaining upper and lower bounds to both false-positive and false-negative decisions with a mastery test. These methods make no assumptions about the form of the true score distribution. (CTM)
Descriptors: Bayesian Statistics, Cutting Scores, Mastery Tests, Mathematical Formulas
De Champlain, Andre F.; Gessaroli, Marc E. – 1997
A study was conducted to compare, with simulated unidimensional and two-dimensional sets, the Type I error probabilities and rejection rates obtained with two versions of the LISREL computer program, the earlier version PRELIS/LISREL 7 and the later version PRELIS2/LISREL8, a version that corrects the asymptotic covariance matrix. Unidimensional…
Descriptors: Chi Square, Comparative Analysis, Goodness of Fit, Item Response Theory
Livingston, Samuel A. – 1984
Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…
Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models
Peer reviewedKristof, Walter – Psychometrika, 1971
Descriptors: Cognitive Measurement, Error of Measurement, Mathematical Models, Psychological Testing
PDF pending restorationGilmer, Jerry S.; Feldt, Leonard S. – 1982
The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…
Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models
Harris, Dickie A.; Penell, Roger J. – 1977
This study used a series of simulations to answer questions about the efficacy of adaptive testing raised by empirical studies. The first study showed that for reasonable high entry points, parameters estimated from paper-and-pencil test protocols cross-validated remarkably well to groups actually tested at a computer terminal. This suggested that…
Descriptors: Adaptive Testing, Computer Assisted Testing, Cost Effectiveness, Difficulty Level
Wang, Wen-Chung; Chen, Cheng-Te – Educational and Psychological Measurement, 2005
This study investigates item parameter recovery, standard error estimates, and fit statistics yielded by the WINSTEPS program under the Rasch model and the rating scale model through Monte Carlo simulations. The independent variables were item response model, test length, and sample size. WINSTEPS yielded practically unbiased estimates for the…
Descriptors: Statistics, Test Length, Rating Scales, Item Response Theory
PDF pending restorationMisanchuk, Earl R. – 1978
Multiple matrix sampling of three subscales of the California Psychological Inventory was used to investigate the effects of four variables on error estimates of the mean (EEM) and variance (EEV). The four variables were examinee population size (600, 450, 300, 150, 100, and 75); number of subtests, (2, 3, 4, 5, 6, and 7), hence the number of…
Descriptors: Adults, Analysis of Variance, Error of Measurement, Item Sampling
Utah State Dept. of Employment Security, Salt Lake City. Western Test Development Field Center. – 1981
Research and analysis conducted to determine the effects of reducing the administration time for one or more levels of the Basic Occupational Literacy Test (BOLT) are described. The total usable sample consisted of 2,423 subjects. Data were collected from 23 states from 1978 to 1981. Data came from a variety of sources, including schools and…
Descriptors: Adult Students, College Students, Minority Groups, Occupational Tests
Scheetz, James P.; Forsyth, Robert A. – 1977
Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…
Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling
de Jong, John H. A. L. – 1984
The Netherlands' secondary education system is highly differentiated, with four different school types for four scholastic ability levels. Final examinations must accommodate these four levels, and require a test-independent definition of the intended final ability levels as well as a sample-free evaluation of the range of ability levels at which…
Descriptors: Difficulty Level, Efficiency, Equated Scores, Foreign Countries

Direct link
