Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 23 |
| Since 2017 (last 10 years) | 563 |
| Since 2007 (last 20 years) | 1786 |
Descriptor
| Statistical Analysis | 2533 |
| Reliability | 1278 |
| Test Reliability | 1074 |
| Foreign Countries | 940 |
| Correlation | 633 |
| Test Validity | 630 |
| Factor Analysis | 559 |
| Validity | 508 |
| Questionnaires | 479 |
| Measures (Individuals) | 411 |
| Test Construction | 338 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 12 |
| Price, Gary G. | 12 |
| Tindal, Gerald | 10 |
| Lai, Cheng-Fei | 9 |
| Brennan, Robert L. | 8 |
| Raykov, Tenko | 8 |
| Feldt, Leonard S. | 7 |
| Livingston, Samuel A. | 7 |
| Park, Bitnara Jasmine | 7 |
| Irvin, P. Shawn | 6 |
| Anderson, Daniel | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 34 |
| Practitioners | 21 |
| Teachers | 10 |
| Students | 8 |
| Administrators | 5 |
| Counselors | 2 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 204 |
| Nigeria | 57 |
| Jordan | 38 |
| Australia | 35 |
| Iran | 35 |
| Taiwan | 35 |
| Canada | 31 |
| China | 30 |
| Germany | 29 |
| California | 28 |
| United Kingdom | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Crocker, Linda; Algina, James – 1986
This text was written to help the reader acquire a base of knowledge about classical psychometrics and to integrate new ideas into that framework of knowledge. The material is organized into five units: (1) introduction to measurement theory; (2) reliability; (3) validity; (4) item analysis in test development; and (5) test scoring and…
Descriptors: Item Analysis, Measurement Techniques, Psychometrics, Scoring
Krakower, Jack – 1987
The validity and reliability of aggregating individuals' perceptions of college organizational characteristics to more macro units of analysis were examined. The focus was the extent to which data collected in a national study met the internal consistency criterion for aggregating perceptual data to the institution level. The study assessed key…
Descriptors: Administrator Attitudes, College Environment, Data Analysis, Higher Education
Shaver, James P. – 1984
The major finding of the Law-Related Education Evaluation Project report for Year 1 (1981), that law-related education courses can reduce juvenile delinquency, is of limited use to educational decision makers and could be misleading. The research design leaves much to be desired; however, that fact must be considered in light of the difficulty of…
Descriptors: Delinquency, Educational Assessment, Legal Education, Outcomes of Education
Moore, R. P.; Shah, B. V. – 1975
The average design effects for statistics estimated from the base-year National Longitudinal Study data are presented. Attempts to partition the effects into those due to stratification, clustering, and unequal weighting are discussed. The expected increases in subpopulation sample sizes due to oversampling are calculated and compared with the…
Descriptors: Efficiency, Graduate Surveys, High School Graduates, High Schools
Smith, Douglas U. – 1979
The ratings of 43 third-year medical students were used to investigate a method of computing the reliability of ratings. The students were evaluated on a four-point rating scale along seven dimensions of clinical ability: diagnostic ability; motivation; sense of responsibility; oral-verbal ability; effectiveness with patients; ability to take a…
Descriptors: Analysis of Variance, Computer Programs, Higher Education, Informal Assessment
Benson, Jeri – 1979
Two methods of item selection were used to select sets of 40 items from a 50-item verbal analogies test, and the resulting item sets were compared for relative efficiency. The BICAL program was used to select the 40 items having the best mean square fit to the one parameter logistic (Rasch) model. The LOGIST program was used to select the 40 items…
Descriptors: Comparative Analysis, Computer Programs, Costs, Efficiency
Darnell, Donald K. – 1968
This final report presents a description of a test combining cloze procedure and an entropy analysis (CLOZENTROPY), designed to measure the compatibility of a foreign student's English with that of his peers who are native speakers of English. This test, and the Test of English as a Foreign Language (TOEFL) were administered to 48 foreign students…
Descriptors: Cloze Procedure, College Students, English (Second Language), Foreign Students
Cross, Lawrence H. – 1975
A novel scoring procedure was investigated in order to obtain scores from a conventional multiple-choice test that would be free of the guessing component or contain a known guessing component even though examinees were permitted to guess at will. Scores computed with the experimental procedure are based not only on the number of items answered…
Descriptors: Algebra, Comparative Analysis, Guessing (Tests), High Schools
Tracy, D. B.; And Others
Responses on both the state and trait scales of the State-Trait Anxiety (STAI) Inventory were examined under two conditions. The first condition presented a simulated real-life situation containing competitive and evaluative cues without directly suggesting faking and asked subjects to complete the STAI. After an intervening task, the STAI was…
Descriptors: Anxiety, College Students, Psychological Patterns, Response Style (Tests)
Morse, David T.; Morse, Linda W. – 1976
Performance testing often entails the usage of expensive, time-consuming measures in the quest for determining the level of performance on some desired behavior. It is concluded that a generalizability theory approach to dealing with departures from reality in testing can aid in the establishment of empirically-based choices of measurement…
Descriptors: Cost Effectiveness, Decision Making, Mathematical Models, Measurement Techniques
Simon, Alan J.; Joiner, Lee M. – 1974
The effectiveness of test adaptation based on item selection and reordering of a Spanish (Mexican) version of the Peabody Picture Vocabulary Test (PPVT) was examined. Translated forms were administered to a sample of Mexican students. One item from each pair (A and B) was selected and reordered using a priori rules. The revised instrument was…
Descriptors: Culture Fair Tests, Elementary Education, Elementary School Students, Intelligence Tests
Smith, John E.; And Others
The purpose of this study was to construct and validate a self-rating scale which can be easily administered and quickly scored for discriminating between those high school students who will dropout and those who will not. A 34-item scale was constructed for subjects with a fifth grade reading level. This scale was administered to 113 high school…
Descriptors: Dropout Characteristics, Dropout Research, Dropouts, High School Students
PDF pending restorationMcMorris, Robert F. – 1971
The extent of error likely to occur with each of several approximations for the standard deviation, internal consistency reliability, and the standard error of measurement is analyzed. Approximations were compared with exact statistics obtained on 85 different classroom tests constructed and administered by professors in a variety of fields. Means…
Descriptors: Data Analysis, Error of Measurement, Evaluation Methods, Item Analysis
Bowers, John; Loeb, Jane – 1971
Multiple regression equations predicting first semester grade point average (GPA) from high school percentile rank (HSPR) and Composite score on the American College Test (ACT:C) were examined for five successive fall freshman classes (1965-1969) at the University of Illinois. Slopes were significantly different among the five equations. Further…
Descriptors: Academic Achievement, Academic Standards, Aptitude Tests, College Admission
Bayuk, Robert J. – 1973
An investigation was conducted to determine the effects of response-category weighting and item weighting on reliability and predictive validity. Response-category weighting refers to scoring in which, for each category (including omit and "not read"), a weight is assigned that is proportional to the mean criterion score of examinees selecting…
Descriptors: Aptitude Tests, Correlation, Predictive Validity, Research Reports


