Descriptor
| Statistical Distributions | 6 |
| Test Construction | 3 |
| Equated Scores | 2 |
| Mathematical Models | 2 |
| Reliability | 2 |
| Scaling | 2 |
| Scoring | 2 |
| State Programs | 2 |
| Test Items | 2 |
| Test Reliability | 2 |
| Test Results | 2 |
| More ▼ | |
Source
| Applied Measurement in… | 6 |
Author
| Bandalos, Deborah L. | 1 |
| Cummings, Cynthia B. | 1 |
| Edwards, Don | 1 |
| Enders, Craig K. | 1 |
| Feldt, Leonard S. | 1 |
| Hanson, Bradley A. | 1 |
| Kiplinger, Vonda L. | 1 |
| Linn, Robert L. | 1 |
| Schiel, Jeffrey L. | 1 |
| Shaw, Dale G. | 1 |
Publication Type
| Journal Articles | 6 |
| Reports - Evaluative | 6 |
| Information Analyses | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Peer reviewedHanson, Bradley A. – Applied Measurement in Education, 1996
Determining whether score distributions differ on two or more test forms administered to samples of examinees from a single population is explored using three statistical tests using loglinear models. Examples are presented of applying tests of distribution differences to decide if equating is needed for alternative forms of a test. (SLD)
Descriptors: Equated Scores, Scoring, Statistical Distributions, Test Format
Peer reviewedEnders, Craig K.; Bandalos, Deborah L. – Applied Measurement in Education, 1999
Examined the degree to which coefficient alpha is affected by including items with different distribution shapes within a unidimensional scale. Computer simulation results indicate that reliability does not increase dramatically as a result of using differentially shaped items within a scale. Discusses implications for test construction. (SLD)
Descriptors: Computer Simulation, Reliability, Scaling, Statistical Distributions
Peer reviewedEdwards, Don; Cummings, Cynthia B. – Applied Measurement in Education, 1990
An evolved form of the Edwards and Beckworth (1989) model for probability selection for Scholastic Achievement Test takers using truncated normal distributions is presented. It is shown that the arguments of Holland and Wainer are not sufficient to reject this model. (SLD)
Descriptors: College Entrance Examinations, Models, Participation, Probability
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 1993
The recommendation that the reliability of multiple-choice tests will be enhanced if the distribution of item difficulties is concentrated at approximately 0.50 is reinforced and extended in this article by viewing the 0/1 item scoring as a dichotomization of an underlying normally distributed ability score. (SLD)
Descriptors: Ability, Difficulty Level, Guessing (Tests), Mathematical Models
Peer reviewedLinn, Robert L.; Kiplinger, Vonda L. – Applied Measurement in Education, 1995
The adequacy of linking statewide standardized test results to the National Assessment of Educational Progress by using equipercentile equating procedures was investigated using statewide mathematics data from four states. Results suggest that the linkings are not sufficiently trustworthy to make comparisons based on the tails of the distribution.…
Descriptors: Comparative Analysis, Educational Assessment, Equated Scores, Mathematics Tests
Peer reviewedSchiel, Jeffrey L.; Shaw, Dale G. – Applied Measurement in Education, 1992
Changes in information retention resulting from changes in reliability and number of intervals in scale construction were studied to provide quantitative information to help in decisions about choosing intervals. Information retention reached a maximum when the number of intervals was about 8 or more and reliability was near 1.0. (SLD)
Descriptors: Decision Making, Knowledge Level, Mathematical Models, Monte Carlo Methods


