Publication Date
In 2025 | 205 |
Since 2024 | 705 |
Since 2021 (last 5 years) | 2293 |
Since 2016 (last 10 years) | 4594 |
Since 2006 (last 20 years) | 6899 |
Descriptor
Test Reliability | 14762 |
Test Validity | 9771 |
Test Construction | 4248 |
Foreign Countries | 3657 |
Psychometrics | 2361 |
Factor Analysis | 2251 |
Measures (Individuals) | 1717 |
Evaluation Methods | 1401 |
Higher Education | 1384 |
Correlation | 1234 |
Questionnaires | 1228 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 452 |
Practitioners | 319 |
Teachers | 128 |
Administrators | 73 |
Policymakers | 33 |
Counselors | 31 |
Students | 17 |
Parents | 10 |
Community | 6 |
Support Staff | 5 |
Location
Turkey | 797 |
Australia | 236 |
Canada | 205 |
China | 195 |
Indonesia | 142 |
Spain | 124 |
United States | 121 |
United Kingdom | 117 |
Germany | 106 |
Taiwan | 103 |
Netherlands | 99 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 1 |
Breland, Hunter M.; And Others – 1987
Six university English departments collaborated in this examination of the differences between multiple-choice and essay tests in evaluating writing skills. The study also investigated ways the two tools can complement one another, ways to improve cost effectiveness of essay testing, and ways to integrate assessment and the educational process.…
Descriptors: Comparative Testing, Efficiency, Essay Tests, Higher Education
Mason, Jana; And Others – 1986
Two contrasting kindergarten reading programs (book-focused and letter-focused) were chosen for a study that evaluated the ability of the Early Reading Test to probe children's knowledge of stories as well as letters, sounds, and words. The test also evaluated the kinds of strategies children use to attempt reading tasks and, through interview…
Descriptors: Beginning Reading, Comparative Analysis, Early Reading, Kindergarten Children
American Coll. Testing Program, Iowa City, IA. – 1981
UNIACT, a major component of the American College Testing (ACT) Assessment Program, is one of the first interest inventories to employ a new technique for ensuring sex fairness in the reporting of scores. UNIACT was constructed with the goal that distributions of career options suggested to males and females would be similar. It is intended to…
Descriptors: Adults, Career Planning, Interest Inventories, Minority Groups
Obrzut, John E. – 1982
This paper reviews the literatuare on projective techniques of personality assessment and their use by school psychologists. Following a brief survey of the development of projective techniques, several of the most widely used techniques are briefly discussed, i.e., the Thematic Apperception Test (TAT), the Childrens Apperception Test (CAT), the…
Descriptors: Counselor Role, Elementary Secondary Education, Evaluation Methods, Literature Reviews
Fraser, Barry J.; Fisher, Darrell L. – 1983
This manual makes accessible several widely used instruments for measuring perceptions of psychosocial characteristics of classroom environment among school students and teachers. Background information, scoring procedures, validation data, and preferred and short forms of the Learning Environment Inventory, My Class Inventory, Classroom…
Descriptors: Attitude Measures, Classroom Environment, Curriculum Evaluation, Outcomes of Education
Jones, Eric D.; And Others – 1983
The purpose of this study was to evaluate the utility of out-of-level testing (OLT) when it is applied to the assessment of special education students with mild learning handicaps. This evaluation of OLT involved testing hypotheses related to: (1) the adequacy of vertical scaling, (2) the reliability and (3) the validity of OLT scores. Fifty-eight…
Descriptors: Educational Diagnosis, Error of Measurement, Guessing (Tests), Intermediate Grades
Smith, Brandon B.; And Others – 1982
A project was conducted to establish the validity of a procedure and set of instruments to facilitate a more systematic approach for the continued staff development of inservice secondary and postsecondary vocational instructors in three states (Kentucky, Virginia, and Wisconsin) with different delivery systems. The population consisted of a…
Descriptors: Evaluation Criteria, Evaluation Methods, Inservice Teacher Education, Needs Assessment
Cuttance, Peter F. – 1982
Covariance structure modelling is applied to the problem of estimating reliability and measurement error in survey data. To provide a basis for grouping certain question or variable types (data from questions), a simple typology based on the formal characteristics of the questions is outlined. From this classification, models for the different…
Descriptors: Analysis of Covariance, Correlation, Educational Research, Error of Measurement
Mislevy, Robert J.; And Others – 1982
An approach was developed based on item-response models defined at the level of salient subject groups rather than at the level of individuals, designed for use with multiple-matrix sampling designs. In each of three National Assessment of Educational Progress (NAEP) mathematics subtopics, Reiser's group-effects latent trait model was fitted to…
Descriptors: Educational Assessment, Item Analysis, Item Sampling, Latent Trait Theory
Newburger, Craig Alan – 1982
A study was conducted to test four hypotheses concerning modification of student self-concept in communication courses: (1) different kinds of training affect student self-concept in different ways, (2) scale bias affects measurement of student self-concept, (3) male and female self-concepts change differently, and (4) course grade affects student…
Descriptors: Attitude Change, College Students, Communication Apprehension, Communication Research
Massachusetts State Dept. of Education, Boston. Bureau of Research and Assessment. – 1982
Since the approval of the Basic Skills Improvement Policy in 1978, the Massachusetts Department of Education has been developing tests and alternative forms for the assessment of student achievement in five basic skills content areas: reading, writing, mathematics, listening, and speaking. Because of the lack of previous research on which to draw…
Descriptors: Achievement Tests, Basic Skills, Elementary Secondary Education, Policy Formation
Illback, Robert; And Others – 1982
Early identification of children at risk for various forms of school maladaptation is critical in rural schools, where services and resources are typically limited. The present study assesses the psychometric characteristics and utility of the AML, a teacher rating scale employed in a rural region. The 11-item teacher scale yields 4 scores:…
Descriptors: Age Differences, Early Identification, Elementary School Students, High Risk Students
Cashin, William E. – 1985
There are four reasons why comparative data is needed for student ratings of faculty performance: (1) the considerable inflation of student ratings; (2) the great variability in the way students rate different items; (3) because student rating systems must be flexible and comparable; and (4) because of factors (such as student motivation, class…
Descriptors: College Faculty, College Students, Comparative Analysis, Higher Education
Green, Kathy E.; Stager, Susan F. – 1985
This paper reports the development and testing of measures of teachers' attitudes toward testing and appropriate use of tests. A random sample of 555 practicing teachers in Wyoming were surveyed by mail (81 percent response rate). Five subscales assessing attitudes toward use of classroom and standardized tests were identified: (1) standardized…
Descriptors: Attitude Measures, Elementary Secondary Education, Factor Analysis, Standardized Tests
Interrater Reliability and Internal Consistency of Student and Staff Ratings of Medical Instruction.
Dielman, T. E.; Horvatich, Paula K. – 1985
The purposes of this study were to establish the interrater reliability, dimensionality, and internal consistency of an instruction evaluation instrument used at The University of Michigan Medical School. Using the nine-item rating scale, 1,758 student ratings and 88 staff ratings were gathered on 61 faculty. Interrater agreement ranged from .28…
Descriptors: Evaluation Methods, Graduate Medical Education, Higher Education, Interrater Reliability