Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Item Sampling | 70 |
| Statistical Analysis | 70 |
| Mathematical Models | 16 |
| Matrices | 14 |
| Sampling | 14 |
| Test Construction | 13 |
| Test Reliability | 13 |
| Item Analysis | 12 |
| Test Items | 12 |
| Error of Measurement | 10 |
| Test Interpretation | 10 |
| More ▼ | |
Source
Author
| Shoemaker, David M. | 14 |
| Pandey, Tej N. | 4 |
| Forsyth, Robert A. | 3 |
| Harris, Chester W. | 2 |
| Scheetz, James P. | 2 |
| Sirotnik, Ken | 2 |
| Aparisi, D. | 1 |
| Austin, Dean A. | 1 |
| Bashkov, Bozhidar M. | 1 |
| Beaton, Albert E. | 1 |
| Bechger, Timo M. | 1 |
| More ▼ | |
Publication Type
Education Level
| Secondary Education | 2 |
| Grade 10 | 1 |
| Grade 9 | 1 |
| High Schools | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
Audience
| Researchers | 1 |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
| National Assessment of… | 2 |
| Armed Services Vocational… | 1 |
| California Achievement Tests | 1 |
| California Psychological… | 1 |
| Graduate Record Examinations | 1 |
| Program for International… | 1 |
What Works Clearinghouse Rating
Peer reviewedPassmore, David Lynn – Journal of Studies in Technical Careers, 1983
Vocational and technical education researchers need to be aware of the uses and limits of various statistical models. The author reviews the Rasch Model and applies it to results from a nutrition test given to student nurses. (Author)
Descriptors: Educational Research, Item Sampling, Nursing Education, Nutrition
Epstein, Kenneth I.; Knerr, Claramae S. – 1976
The literature on criterion referenced testing is full of discussions concerning whether classical measurement techniques are appropriate, whether variance is necessary, whether new indices of reliability are needed, and the like. What appears to be lacking, however, is a clear and simple discussion of why the problems occur. This paper suggests…
Descriptors: Career Development, Criterion Referenced Tests, Item Analysis, Item Sampling
Peer reviewedFrederiksen, Norman; Ward, William C. – Applied Psychological Measurement, 1978
A set of Tests of Scientific Thinking were developed for possible use as criterion measures in research on creativity. Scores on the tests describe both quality and quantity of ideas produced in formulating hypotheses, evaluating proposals, solving methodological problems, and devising methods for measuring constructs. (Author/CTM)
Descriptors: Creativity Tests, Higher Education, Item Sampling, Predictive Validity
Molina, Huberto; Shoemaker, David M. – 1977
In discussion of assessment and its applicability to the needs of the Spanish-speaking student population, four types of assessments are presented in the context of the educationally significant testing needed in this area. The focus of this paper is on the use of comprehensive assessment to measure the repertoire of language resources that the…
Descriptors: Bilingual Students, Educational Testing, English (Second Language), Item Sampling
PDF pending restorationEstes, Carole; Estes, Gary D. – 1980
Multiple matrix sampling is a sampling design in which both test items and examinees are randomly sampled from their respective populations. This study was designed to develop and assess a method for computing an estimate of a correlation coefficient when a multiple matrix sampling design is used. The examinee populations included 212 third-grade…
Descriptors: Correlation, Elementary Secondary Education, Evaluation Methods, Grade 3
Wilcox, Rand R. – 1977
Three statistical problems related to criterion-referenced testing are investigated: estimation of the likelihood of a false-positive or false-negative decision with a mastery test, estimation of true scores in the Compound Binomial Error Model, and comparison of the examinees to a control. Two methods for estimating the likelihood of…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error Patterns, Item Sampling
Berk, Ronald A. – 1978
Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…
Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides
Fruchter, Dorothy A.; Ree, Malcolm James – 1977
In order to meet the needs of all the Armed Services, new forms of the Armed Services Vocational Aptitude Battery (ASVAB) must periodically be developed, refined, and standardized on an appropriate normative sample. Since one of the uses of the ASVAB is to determine candidate suitability for military service, it is necessary for the…
Descriptors: Aptitude Tests, Armed Forces, Equated Scores, Item Analysis
PDF pending restorationMisanchuk, Earl R. – 1978
Multiple matrix sampling of three subscales of the California Psychological Inventory was used to investigate the effects of four variables on error estimates of the mean (EEM) and variance (EEV). The four variables were examinee population size (600, 450, 300, 150, 100, and 75); number of subtests, (2, 3, 4, 5, 6, and 7), hence the number of…
Descriptors: Adults, Analysis of Variance, Error of Measurement, Item Sampling
Shoemaker, David M. – 1972
The post mortem item-examinee sampling investigation described herein explored the feasibility of using item-examinee sampling to estimate scale values denoting degree of affect toward stimuli when measured by the method of paired-comparisons. Results indicate clearly that such scale values can be approximated satisfactorily through item-examinee…
Descriptors: Attitude Measures, Attitudes, Item Sampling, Mathematical Applications
Scheetz, James P.; Forsyth, Robert A. – 1977
Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…
Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling
Pandey, Tej N. – 1978
The concept under investigation was the reliability of estimates of mean scores of groups under various assumptions of multiple-matrix sampling when reliabilities are computed according to procedures based on generalizability theory. Four different cases were compared with respect to the generalizability coefficients depending upon whether pupils…
Descriptors: Achievement Tests, Analysis of Variance, Basic Skills, Elementary Secondary Education
Douglass, James B. – 1979
A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to compare five models for test equating: (1) anchor test equating using classical test theory; (2) anchor test equating using the one-parameter logistic…
Descriptors: Comparative Analysis, Equated Scores, Flow Charts, Goodness of Fit
Harris, Chester W.; And Others – 1977
The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…
Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies
Theunissen, Phiel J. J. M. – 1983
Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…
Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling


