Descriptor
| Estimation (Mathematics) | 22 |
| Mathematical Models | 22 |
| Test Reliability | 22 |
| Equations (Mathematics) | 11 |
| Scores | 8 |
| Testing Problems | 5 |
| True Scores | 5 |
| Comparative Analysis | 4 |
| Computer Simulation | 4 |
| Equated Scores | 4 |
| Bayesian Statistics | 3 |
| More ▼ | |
Source
| Educational and Psychological… | 5 |
| Applied Psychological… | 2 |
| Psychometrika | 2 |
| Applied Measurement in… | 1 |
| Educational Measurement:… | 1 |
| Journal of Educational… | 1 |
| Journal of Educational… | 1 |
Author
| Algina, James | 1 |
| Braun, Henry I. | 1 |
| Brown, R. L. | 1 |
| Cahan, Sorel | 1 |
| Cobern, William W. | 1 |
| Cook, Linda L. | 1 |
| Eiting, Mindert H. | 1 |
| Gross, Susan K. | 1 |
| Gustafsson, Jan-Eric | 1 |
| Hanson, Bradley A. | 1 |
| Harvill, Leo M. | 1 |
| More ▼ | |
Publication Type
| Reports - Evaluative | 15 |
| Journal Articles | 13 |
| Speeches/Meeting Papers | 7 |
| Reports - Research | 6 |
| Computer Programs | 1 |
| Reports - Descriptive | 1 |
Education Level
Audience
| Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
| SAT (College Admission Test) | 3 |
| Advanced Placement… | 1 |
| National Assessment of… | 1 |
What Works Clearinghouse Rating
Peer reviewedHanson, Bradley A. – Journal of Educational Statistics, 1991
The formula developed by R. Levine (1955) for equating unequally reliable tests is described. The formula can be interpreted as a method of moments estimate of an equating function that results in first order equity of the equated test score under a classical congeneric model. (TJH)
Descriptors: Equated Scores, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedZimmerman, Donald W.; And Others – Educational and Psychological Measurement, 1993
Coefficient alpha was examined through computer simulation as an estimate of test reliability under violation of two assumptions. Coefficient alpha underestimated reliability under violation of the assumption of essential tau-equivalence of subtest scores and overestimated it under violation of the assumption of uncorrelated subtest error scores.…
Descriptors: Computer Simulation, Estimation (Mathematics), Mathematical Models, Robustness (Statistics)
Peer reviewedReuterberg, Sven-Eric; Gustafsson, Jan-Eric – Educational and Psychological Measurement, 1992
The use of confirmatory factor analysis by the LISREL program is demonstrated as an assumption-testing method when computing reliability coefficients under different model assumptions. Results indicate that reliability estimates are robust against departure from the assumption of parallelism of test items. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Mathematical Models, Robustness (Statistics)
Peer reviewedRae, Gordon – Educational and Psychological Measurement, 1991
A brief overview is provided of the Conger-Lipshitz approach to estimating the reliability of a profile or test battery. A computational example from a recent study shows how canonical reliability can be obtained through existing statistical software. (SLD)
Descriptors: Computer Assisted Testing, Computer Software, Correlation, Equations (Mathematics)
Braun, Henry I. – 1986
This report describes a statistically designed experiment that was carried out in an operational setting to determine the contributions of different sources of variation to the unreliability of scoring. The experiment made novel use of partially balanced incomplete block designs that facilitated the unbiased estimation of certain main effects…
Descriptors: Essay Tests, Estimation (Mathematics), Mathematical Models, Research Design
Linacre, John M. – 1993
Generalizability theory (G-theory) and many-facet Rasch measurement (Rasch) manage the variability inherent when raters rate examinees on test items. The purpose of G-theory is to estimate test reliability in a raw score metric. Unadjusted examinee raw scores are reported as measures. A variance component is estimated for the examinee…
Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Evaluators
PDF pending restorationCobern, William W. – 1986
This computer program, written in BASIC, performs three different calculations of test reliability: (1) the Kuder-Richardson method; (2); the "common split-half" method; and (3) the Rulon-Guttman split-half method. The program reads sequential access data files for microcomputers that have been set up by statistical packages such as…
Descriptors: Computer Software, Difficulty Level, Educational Research, Equations (Mathematics)
Livingston, Samuel A. – 1984
Much previously published material for estimating the reliability of classification has been based on the assumption that a test consists of a known number of equally weighted items. The test score is the number of those items answered correctly. These methods cannot be used with classifications based on weighted composite scores, especially if…
Descriptors: Equated Scores, Essay Tests, Estimation (Mathematics), Mathematical Models
Peer reviewedCahan, Sorel – Educational and Psychological Measurement, 1989
Statistical significance and "abnormality" have been used as criteria for the evaluation of intra-individual subtest score differences. Shortcomings of these criteria are identified, and improved estimates of the true score differences are suggested. The applicability of the abnormality criterion to these improved estimates is reviewed.…
Descriptors: Estimation (Mathematics), Evaluation Methods, Individual Differences, Mathematical Models
Peer reviewedLin, Miao-Hsiang; Hsiung, Chao A. – Psychometrika, 1992
Four bootstrap methods are identified for constructing confidence intervals for the binomial-error model. The extent to which similar results are obtained and the theoretical foundation of each method and its relevance and ranges of modeling the true score uncertainty are discussed. (SLD)
Descriptors: Bayesian Statistics, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedRozeboom, William W. – Applied Psychological Measurement, 1989
Formulas are provided for estimating the reliability of a linear composite of non-equivalent subtests given the reliabilities of component subtests. The reliability of the composite is compared to that of its components. An empirical example uses data from 170 children aged 4 through 8 years performing 34 Piagetian tasks. (SLD)
Descriptors: Elementary School Students, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedEiting, Mindert H. – Applied Psychological Measurement, 1991
A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedHolland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990
Two attempts to adjust state mean Scholastic Aptitude Test (SAT) scores for differential participation rates are examined. Both attempts are rejected, and five rules for performing adjustments are outlined to foster follow-up checks on untested assumptions. National Assessment of Educational Progress state data are determined to be more accurate.…
Descriptors: College Applicants, College Entrance Examinations, Estimation (Mathematics), Item Bias
Peer reviewedBrown, R. L. – Educational and Psychological Measurement, 1989
Three correlation matrices (PEARSON, POLYCHORIC, and TOBIT) were used to obtain reliability estimates on ordered polytomous variable models. A Monte Carlo study with different levels of variable asymmetry and 400 sample correlation matrices demonstrated that the PEARSON matrix did not perform as well as did the other 2 matrices. (SLD)
Descriptors: Analysis of Covariance, Comparative Analysis, Computer Simulation, Correlation
Peer reviewedHarvill, Leo M. – Educational Measurement: Issues and Practice, 1991
This paper discusses standard error of measurement (SEM), the amount of variation or spread in the measurement errors for a test, and gives information needed to interpret test scores using SEMs. SEMs at various score levels should be used in calculating score bands rather than a single SEM value. (SLD)
Descriptors: Definitions, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Previous Page | Next Page ยป
Pages: 1 | 2

