Publication Date
| In 2026 | 2 |
| Since 2025 | 454 |
| Since 2022 (last 5 years) | 1933 |
| Since 2017 (last 10 years) | 4505 |
| Since 2007 (last 20 years) | 6990 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 837 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 161 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Educational Testing Service, Princeton, NJ. – 1973
A filmstrip with associated audio track has been developed to cover the major planning steps in the development of a measurement instrument such as a test or questionnaire. The filmstrip addresses the following six questions: Why am I testing? What should I test? Whom am I testing? What kinds of questions should I use? How long should my test be?…
Descriptors: Criterion Referenced Tests, Filmstrips, Guides, Instructional Films
Friedman, Martin R.; And Others – 1974
The present study attempted to modify the latencies and errors of adult women on the Matching Familiar Figures test (MFF) by systematically altering task instructions. The results indicated that latencies of impulsive subjects could be altered with "reflective" instructions, while the latencies of reflective subjects were resistent to…
Descriptors: Adults, Cognitive Processes, Females, Individual Differences
Stocking, Martha; And Others – 1973
For two tests measuring the same trait, the program, BIV20, equates the scores using the two True score distributions estimated by the univariate method 20 program (see Wingersky, Lees, Lennon, and Lord, 1969) and, with these equated true scores and their distributions, estimates the bivariate distribution scores and the relative efficiency of the…
Descriptors: Computer Programs, Equated Scores, Statistical Analysis, Test Reliability
Cory, Charles H.; And Others – 1973
The Recruit Performance Test (RPT) was developed to meet a widely recognized need for an accurate measurement of achievement in learning the military/psychomotor skills in recruit training. It was hoped that the test would be of special value in assessing the achievement of Category IV personnel, who are thought to be penalized by the present…
Descriptors: Achievement Tests, Military Training, Performance Tests, Psychomotor Skills
Woodson, M. I. Charles E.
The item (difficulty and discrimination) and test (reliability and validity) statistics in classical test theory are highly dependent upon the calibration sample of individuals used. The estimates of item and test parameters in classical test theory is valid within a range of interest along the characteristic measured. Generally, this range of…
Descriptors: Criterion Referenced Tests, Item Analysis, Research Reports, Statistics
Mendro, Robert – 1971
A major problem in the research concerning distributional and other properties of reliability coefficients has been the non-existence or inaccessibility of adequate test data for use in empirical verification of hypothetical conclusions. The purpose of this paper is to develop a technique for the simulation of test item scores through the use of…
Descriptors: Computer Programs, Factor Analysis, Models, Reliability
Pellegrine, R. J. – 1970
The Diagnostic Reading Tests were designed to assess the reading skills of college students enrolled in reading centers. To assess the reliability of the Diagnostic Reading Tests, Survey Section, Form E (DRTE), a study was conducted with university freshmen as subjects. The DRTE was administered to 31 students in an Educational Opportunity Program…
Descriptors: College Freshmen, Disadvantaged Youth, Reading Centers, Reading Diagnosis
Toole, Patrick F.; And Others – 1970
Section 3 of Phase II of the Pennsylvania Plan is concerned with the adequacy of the educational quality assessment instruments. An overall discussion of reliability--content, criterion related, and construct--is presented. Reliability coefficients for the assessment inventories are provided and empirical studies of validity are described. (PR)
Descriptors: Educational Quality, Measurement Instruments, Questionnaires, Reliability
Lord, Frederic M. – 1972
The stepped-up reliability coefficient does not have the same standard error as an ordinary correlation coefficient. Fisher's Z -transformation should not be applied to it. Appropriate procedures are suggested. (Author)
Descriptors: Analysis of Variance, Mathematical Models, Research, Research Reports
Mandeville, Garrett K. – 1973
An investigation is conducted which presents extensive Monte Carlo results which indicate the conditions under which a procedure using the F distribution can be used to study the robustness of the confidence interval procedures for small samples. A review of the literature is presented. Procedure uses a binary data matrix. Results indicate that…
Descriptors: Confidence Testing, Item Sampling, Literature Reviews, Monte Carlo Methods
PDF pending restorationKristof, Walter – 1973
This study in parametric test theory deals with the statistics of reliability estimation when scores on two parts of a test follow a binormal distribution with equal (case 1) or unequal (case 2) expectations. In each case biased maximum-likelihood estimators of reliability are obtained and converted into unbiased estimators. Sampling distributions…
Descriptors: Expectation, Research Reports, Sample Size, Sampling
Werts, Charles E.; Linn, Robert L. – 1972
Given multiple independent measures of an underlying true factor and information on group membership, it is possible to compute a set of observed group means for each measure. Given at least three tests, these sets of means may be used to compute the reliability of the means for each test. The procedure for estimating true scores from the…
Descriptors: Factor Analysis, Mathematical Models, Research, Research Reports
Reilly, Richard R.; Jackson, Rex – 1972
Item options of shortened forms of the Graduate Record Examination Verbal and Quantitative tests were empirically weighted by two variants of a method originally attributed to Guttman. The first method assigned to each option of an item the mean standard score on the remaining items of all subjects choosing that option. The second procedure…
Descriptors: Correlation, Factor Analysis, Graduate Study, Scoring
Mandeville, Garrett K.
Results of a comparative study of F and Q tests, in a randomized block design with one replication per cell, are presented. In addition to these two procedures, a multivariate test was also considered. The model and test statistics, data generation and parameter selection, results, summary and conclusions are presented. Ten tables contain the…
Descriptors: Comparative Analysis, Data Analysis, Mathematical Models, Models
Garvin, Alfred D.
Confidence weighting (CW) tends to improve the reliability of easy tests; the Coombs-type multiple-response (MR) option tends to improve the reliability of hard tests. It was hypothesized that, on a test of moderate difficulty, offering both the CW and MR response options would improve reliability more than either alone. Twenty-four subjects took…
Descriptors: Confidence Testing, Educational Testing, Multiple Choice Tests, Response Style (Tests)


