NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 346 to 360 of 636 results Save | Export
Peer reviewed Peer reviewed
Conger, Anthony J. – Educational and Psychological Measurement, 1983
A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)
Descriptors: Generalizability Theory, Test Construction, Test Items, Test Length
Peer reviewed Peer reviewed
Allison, Paul A. – Psychometrika, 1976
A direct proof is given for the generalized Spearman-Brown formula for any real multiple of test length. (Author)
Descriptors: Correlation, Error of Measurement, Raw Scores, Test Length
PDF pending restoration PDF pending restoration
De Champlain, Andre F.; Gessaroli, Marc E.; Tang, K. Linda; De Champlain, Judy E. – 1998
The empirical Type I error rates of Poly-DIMTEST (H. Li and W. Stout, 1995) and the LISREL8 chi square fit statistic (K. Joreskog and D. Sorbom, 1993) were compared with polytomous unidimensional data sets simulated to vary as a function of test length and sample size. The rejection rates for both statistics were also studied with two-dimensional…
Descriptors: Chi Square, Goodness of Fit, Item Response Theory, Sample Size
Peer reviewed Peer reviewed
Silverstein, A. B. – Perceptual and Motor Skills, 1983
Formulas for estimating the validity of random short forms were applied to the standardization data for the Wechsler Adult Intelligence Scale-Revised, the Minnesota Multiphasic Personality Inventory, and the Marlowe-Crowne Social Desirability Scale. These formulas demonstrated how much "better than random" the best short forms of these…
Descriptors: Comparative Analysis, Intelligence Tests, Measures (Individuals), Test Format
Peer reviewed Peer reviewed
Stern, Paul C.; Guagnano, Gregory A.; Dietz, Thomas – Educational and Psychological Measurement, 1998
A brief version of the instrument developed by S. Schwartz (1992, 1994) to measure the structure and content of human values was developed. Studies with 199 adults and 420 adults support the reliability of scores produced by the brief inventory's four three-item scales. Uses of the brief form are discussed. (SLD)
Descriptors: Adults, Reliability, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Wiberg, Marie – International Journal of Testing, 2006
A simulation study of a sequential computerized mastery test is carried out with items modeled with the 3 parameter logistic item response theory model. The examinees' responses are either identically distributed, not identically distributed, or not identically distributed together with estimation errors in the item characteristics. The…
Descriptors: Test Length, Computer Simulation, Mastery Tests, Item Response Theory
Peer reviewed Peer reviewed
Modjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1978
The General Education Performance Index (GEPI) is a comparatively short test covering the same content as the General Educational Development Test (GED), which takes ten hours to administer. Correlations of the subtests of the GEPI with the GED ranged from .28 to .57. (JKS)
Descriptors: Correlation, Equivalency Tests, Military Personnel, Statistical Data
Kennedy, Robert L.; McCallister, Corliss J. – 2000
The purpose of this study was to investigate the relationship between the scores students earned on their statistics final examinations and the number of minutes students required to complete the exams. In a previous study, K. Bridges (1985) extended the range of interest in this relationship from a single study to a course-based series, examining…
Descriptors: College Students, Higher Education, Scores, Statistics
Peer reviewed Peer reviewed
Eisenstein, Norman; Engelhart, Charles I. – Psychological Assessment, 1997
The Kaufman Brief Intelligence Test (K-BIT) (A. S. Kaufman and N. L. Kaufman, 1990) was compared with short forms of the Wechsler Adult Intelligence Scale--Revised (WAIS-R) using results from 64 referrals to a neuropsychology service. Advantages of each test are noted and their use discussed. (SLD)
Descriptors: Adults, Comparative Analysis, Intelligence Tests, Neuropsychology
Peer reviewed Peer reviewed
Sunathong, Surintorn; Schumacker, Randall E.; Beyerlein, Michael M. – Journal of Applied Measurement, 2000
Studied five factors that can affect the equating of scores from two tests onto a common score scale through the simulation and equating of 4,860 item data sets. Findings indicate three statistically significant two-way interactions for common item length and test length, item difficulty standard deviation and item distribution type, and item…
Descriptors: Difficulty Level, Equated Scores, Interaction, Item Response Theory
Peer reviewed Peer reviewed
Lewis, Charles; Sheehan, Kathleen – Machine-Mediated Learning, 1988
Introduces a theoretical framework for mastery testing, using Item Response Theory and Bayesian Decision Theory. The idea of sequential testing is developed, with the goal of providing longer or shorter tests as needed, and a computerized application to a hypothetical professional knowledge examination is discussed. (Author/LRW)
Descriptors: Computer Assisted Testing, Licensing Examinations (Professions), Mastery Tests, Psychometrics
Peer reviewed Peer reviewed
Colliver, Jerry A.; And Others – Academic Medicine, 1992
A study investigated optimal length of screening tests used to sort out medical students needing to take a full-length performance-based standardized-patient test from those not needing it. Receiver operating characteristic analysis determined a good length is one-third the full test, with cutoff just above the mean case pass level. (Author/MSE)
Descriptors: Higher Education, Medical Education, Patients, Professional Education
Peer reviewed Peer reviewed
Thompson, Anthony; Browne, Janet; Schmidt, Fred; Boer, Marian – Assessment, 1997
The validity of a four-subtest short form of the third edition of the Wechsler Intelligence Scale for Children (WISC-III) and the Kaufman Brief Intelligence Test (K-BIT) was evaluated with 42 adolescent offenders. Findings support the clinical use of the short form as a good estimate of WISC-III full-scale IQ. (SLD)
Descriptors: Adolescents, Criminals, Delinquency, Intelligence Quotient
Brennan, Robert L. – 1990
In 1955, R. Levine introduced two linear equating procedures for the common-item non-equivalent populations design. His procedures make the same assumptions about true scores; they differ in terms of the nature of the equating function used. In this paper, two parameterizations of a classical congeneric model are introduced to model the variables…
Descriptors: Equated Scores, Equations (Mathematics), Mathematical Models, Research Design
Wilcox, Rand R. – 1980
Wilcox (1977) examines two methods of estimating the probability of a false-positive on false-negative decision with a mastery test. Both procedures make assumptions about the form of the true score distribution which might not give good results in all situations. In this paper, upper and lower bounds on the two possible error types are described…
Descriptors: Cutting Scores, Mastery Tests, Mathematical Models, Student Placement
Pages: 1  |  ...  |  20  |  21  |  22  |  23  |  24  |  25  |  26  |  27  |  28  |  ...  |  43