Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 23 |
| Since 2017 (last 10 years) | 563 |
| Since 2007 (last 20 years) | 1786 |
Descriptor
| Statistical Analysis | 2533 |
| Reliability | 1278 |
| Test Reliability | 1074 |
| Foreign Countries | 940 |
| Correlation | 633 |
| Test Validity | 630 |
| Factor Analysis | 559 |
| Validity | 508 |
| Questionnaires | 479 |
| Measures (Individuals) | 411 |
| Test Construction | 338 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 12 |
| Price, Gary G. | 12 |
| Tindal, Gerald | 10 |
| Lai, Cheng-Fei | 9 |
| Brennan, Robert L. | 8 |
| Raykov, Tenko | 8 |
| Feldt, Leonard S. | 7 |
| Livingston, Samuel A. | 7 |
| Park, Bitnara Jasmine | 7 |
| Irvin, P. Shawn | 6 |
| Anderson, Daniel | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 34 |
| Practitioners | 21 |
| Teachers | 10 |
| Students | 8 |
| Administrators | 5 |
| Counselors | 2 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 204 |
| Nigeria | 57 |
| Jordan | 38 |
| Australia | 35 |
| Iran | 35 |
| Taiwan | 35 |
| Canada | 31 |
| China | 30 |
| Germany | 29 |
| California | 28 |
| United Kingdom | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Levine, Michael V. – 1976
It is shown that empirical mental test P - P plots are approximately equal to theoretical item-item curves, at least for long tests administered to many people. This result is important because it leads to (1) a distribution free method for estimating points on item-item curves; (2) a general method for defining estimates of item parameters; and…
Descriptors: Item Analysis, Latent Trait Theory, Mathematical Applications, Mathematical Models
COX, RICHARD C. – 1965
THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…
Descriptors: Achievement Tests, Classification, Educational Objectives, Item Analysis
NUSS, EUGENE M.; ROOKEY, ERNEST J. – 1966
THIS STUDY WAS PHASE 1 OF A 2-PHASE PROJECT DESIGNED TO CHANGE TEACHER BEHAVIOR WITH THE OBJECTIVE OF IMPROVED CLASSROOM UTILIZATION OF NEW INSTRUCTIONAL MEDIA. PHASE 1 TRIED TO INCREASE TEACHER KNOWLEDGE OF MEDIA VIA A MONTHLY NEWSLETTER, "THE CIRCULATOR," DISTRIBUTED IN 5 DISSEMINATION PATTERNS TO 2,200 EDUCATORS IN PENNSYLVANIA. A…
Descriptors: Audiovisual Instruction, Behavior Change, Educational Media, Experiments
McDermott, Paul A.; Watkins, Marley W. – 1979
A computer program named Program STANDARD is presented and demonstrated. This program calculates the statistical significance of the overall agreement of the categorical assignments. The program is based on Light's statistic, G, for describing the conjoint agreement of many observers with correct or standard set of classifications on nominal…
Descriptors: Classification, Computer Programs, Goodness of Fit, Nonparametric Statistics
Trzasko, Joseph A. – 1975
This paper describes the proposed Assessment Center at Mercy College in Dobbs Ferry, New York. The center is intended to provide statistical and technical support for the Mercy College elementary education, special education, and speech and hearing departments in the areas of student assessment, student guidance, and program evaluation. Evaluation…
Descriptors: Competency Based Teacher Education, Educational Assessment, Evaluation, Pretesting
PDF pending restorationLovett, Hubert T. – 1975
The reliability of a criterion referenced test was defined as a measure of the degree to which the test discriminates between an individual's level of performance and a predetermined criterion level. The variances of observed and true scores were defined as the squared deviation of the score from the criterion. Based on these definitions and the…
Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Mathematical Models
Andrulis, Richard S.; And Others – 1974
The purpose of this investigation was to establish the effects of repeaters on test equating. Since consideration was not given to repeaters in test equating, such as in the derivation of equations by Angoff (1971), the hypothetical effect needed to be established. A case study was examined which showed results on a test as expected; overall mean…
Descriptors: Cutting Scores, Equated Scores, Recall (Psychology), Retention (Psychology)
Larsson, Bernt – Didakometry, 1974
Subjects are asked to answer six questions, partly with a frequency and partly by marking a verbally anchored scale with five categories. Some univariate and multivariate analyses are performed to elucidate the relations between variables with the two different modes of response. Although there are similarities in results for the two types of…
Descriptors: Measurement Techniques, Measures (Individuals), Rating Scales, Responses
Smith, Donald M. – 1974
The concept of scaled achievement tests is discussed and a method of selecting those items of a test that form the most scalable (i.e., having the highest coefficient of reproducibility) subset is presented. Sometimes called a monotonic-deterministic model, this type of test assumes that the test items may be sequentially ordered. To determine the…
Descriptors: Achievement Tests, Arithmetic, Difficulty Level, Item Analysis
Brennan, Robert L. – 1974
An attempt is made to explore the use of subjective probabilities in the analysis of item data, especially criterion-referenced item data. Two assumptions are implicit: (1) one wants to obtain a maximum amount of information with respect to an item using a minimum number of subjects; and (2) once the item is validated, it may well be administered…
Descriptors: Confidence Testing, Criterion Referenced Tests, Guessing (Tests), Item Analysis
Rim, Eui-Do; Bresler, Samuel – 1974
Livingston's reliability coefficients and Harris' indices of efficiency were computed along with the classical internal consistency coefficients, KR-20's (Kuder-Richardson internal consistency coefficient), for 678 criterion-referenced tests in the A through E levels of an individualized mathematics program. The coefficients were carefully studied…
Descriptors: Academic Achievement, Correlation, Criterion Referenced Tests, Elementary School Mathematics
Lapp, Diane – 1970
The Behavioral Objectives Writing Skills Test (BOWST) was designed to provide an estimate of the elementary teacher's ability to write behavioral objectives. This instrument, which requires the teacher to develop three behavioral objectives for each of four hypothetical classroom settings, has wide utility as a teacher-training tool. It may be…
Descriptors: Behavioral Objectives, Elementary Education, Higher Education, Scoring
Gable, Robert K.; Roberts, Arthur D.
The development and preliminary validation of an instrument to measure attitude toward school subjects (GRASS) is described. An item pool consisting of 30 items was generated and refined. A 23 Likert item scale was then administered to 893 eleventh and twelfth grade high schools Ss. A principal component analysis and obliquimax transformation…
Descriptors: Content Analysis, Grade 11, Grade 12, Measurement Instruments
Tucker, Ledyard R.; And Others – 1970
Three topics in factor analysis are covered: a) a reliability coefficient for assessing the quality of a maximum likelihood factor analysis, b) an application of three-mode factor analysis to serial learning data, showing variations in learning curves over stages of learning and individuals, and c) the use of personal probability functions to…
Descriptors: Correlation, Factor Analysis, Hypothesis Testing, Individual Differences
Stanley, Julian C.; Livingston, Samuel A. – 1971
Besides the ubiquitous Pearson product-moment r, there are a number of other measures of relationship that are attenuated by errors of measurement and for which the relationship between true measures can be estimated. Among these are the correlation ratio (eta squared), Kelley's unbiased correlation ratio (epsilon squared), Hays' omega squared,…
Descriptors: Analysis of Variance, Cluster Grouping, Correlation, Data Analysis


