Publication Date
In 2025 | 205 |
Since 2024 | 705 |
Since 2021 (last 5 years) | 2293 |
Since 2016 (last 10 years) | 4594 |
Since 2006 (last 20 years) | 6899 |
Descriptor
Test Reliability | 14762 |
Test Validity | 9771 |
Test Construction | 4248 |
Foreign Countries | 3657 |
Psychometrics | 2361 |
Factor Analysis | 2251 |
Measures (Individuals) | 1717 |
Evaluation Methods | 1401 |
Higher Education | 1384 |
Correlation | 1234 |
Questionnaires | 1228 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 452 |
Practitioners | 319 |
Teachers | 128 |
Administrators | 73 |
Policymakers | 33 |
Counselors | 31 |
Students | 17 |
Parents | 10 |
Community | 6 |
Support Staff | 5 |
Location
Turkey | 797 |
Australia | 236 |
Canada | 205 |
China | 195 |
Indonesia | 142 |
Spain | 124 |
United States | 121 |
United Kingdom | 117 |
Germany | 106 |
Taiwan | 103 |
Netherlands | 99 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 1 |

Erlich, Oded; Borich, Gary – Journal of Educational Measurement, 1979
Generalizability theory was used to study the occurrence and generalizability of classroom interaction measures. Results indicated infrequent occurrence and lack of generalizability of many behaviors of the Brophy-Good Teacher-Child Dyadic Interaction System. (JKS)
Descriptors: Classroom Observation Techniques, Classroom Research, Elementary Education, Interaction Process Analysis

Korth, Bruce – Journal of Educational Measurement, 1979
Student ratings on general, broad questionnaires about instructors are shown to lack validity to the extent that the ratings can be predicted from irrelevant characteristics, such as characteristics that belong to the student, the particular class, or interactions of the student and the class. (Author/JKS)
Descriptors: College Faculty, College Students, Course Evaluation, Evaluation Criteria

Smith, Jeffrey K.; Krajkovich, Joseph G. – Educational and Psychological Measurement, 1979
The Image of Science and Scientists Scale was developed to assess high school students' attitudes toward science as a field of study. Reliability and three types of validity evidence are reported for the scale. (Author)
Descriptors: Academic Achievement, Attitude Measures, Grade 9, High Schools

Krassowksi, Elaine; Plante, Elena – Journal of Communication Disorders, 1997
The practice of cognitive referencing to determine the presence of a specific language impairment (SLI) and eligibility for services is questioned by a study which compared the variability of the IQ scores of children with specific language impairment over time. The study found high IQ variability, suggesting that IQs reflect current abilities…
Descriptors: Academic Aptitude, Cognitive Ability, Cognitive Development, Disability Identification
Popham, W. James – School Administrator, 2003
An authority on student assessment says the ability of public schools to meet federal expectations will depend on the instructional sensitivity of the tests in use. Offers a six-step blueprint for carrying out a public-information campaign about NCLB tests. Lists websites for two reports by five national education associations on instructionally…
Descriptors: Accountability, Elementary Secondary Education, High Stakes Tests, Instructional Improvement

Robinson, Peter – Language Learning, 1997
Examines claims that unconscious second language learning under implicit and incidental conditions is insensitive to measures of individual differences in cognitive abilities, in contrast to learning under conscious rule-search and instructed conditions. Findings revealed that only in the incidental condition was the extent of learning and…
Descriptors: Adult Learning, Adult Students, Classroom Environment, Cognitive Ability
The Use of Pedometry To Evaluate the Physical Activity Levels among Preschool Children in Hong Kong.

Louie, Lobo; Chan, Lily – Early Child Development and Care, 2003
This study used pedometry and the Children Activity Rating Scale (CARS) to investigate physical activity among 3- to 5-year-olds in Hong Kong preschools. Findings indicated that older children were more active than younger ones; boys were more active than girls. Older children in the rural school with larger outdoor play space were more active…
Descriptors: Age Differences, Comparative Analysis, Early Childhood Education, Foreign Countries

Goldstein, Sam – Journal of Autism and Developmental Disorders, 2002
The reliability, validity, and clinical utility of the Asperger Syndrome Diagnostic Scale in the diagnosis of pervasive developmental disorders are reviewed. While the measure holds promise as a research tool, there appears little evidence that it can distinguish among the variety of types of pervasive developmental disorders, or diagnose Asperger…
Descriptors: Asperger Syndrome, Autism, Behavior Rating Scales, Classification

Kolstad, Rosemarie K.; Kolstad, Robert A. – Journal of Dental Education, 1991
A study evaluated the use of a "none-of-these" option on multiple-choice achievement tests in undergraduate dental education. Results indicated this option neither enhanced nor diminished examinee performance stability but did reduce the examinee's opportunity to select correct choices by means unrelated to course objectives, thereby enhancing…
Descriptors: Achievement Tests, Dental Schools, Difficulty Level, Higher Education

Harper, Dennis C.; Wadsworth, John S. – Research in Developmental Disabilities, 1990
This article investigates cognitive decline and depressive symptomatology among older adults with mental retardation. A pilot study of assessment instruments is reported. Findings reveal that decreasing cognitive ability is associated with higher rates of observed depression and reported behavioral problems. Cognitive decline was associated with…
Descriptors: Aging (Individuals), Behavior Problems, Clinical Diagnosis, Cognitive Ability

Read, John – English for Specific Purposes, 1990
Considers the question of how best to elicit samples of writing for assessment in an English-for-academic-purposes proficiency test and assure that every test taker has something to write about. Three types of writing tasks are defined and analyzed, and examples are given. (25 references) (GLR)
Descriptors: English for Academic Purposes, Higher Education, Language Proficiency, Prior Learning

Nist, Sherrie L.; And Others – Reading Research and Instruction, 1990
Investigates the utility and predictive validity of the Learning and Study Strategies Inventory (LASSI) as a means of measuring college students' cognitive and affective growth following a study strategies course. Finds cognitive and affective growth in both regularly admitted and developmental studies students. Finds that LASSI cannot yet be used…
Descriptors: Affective Measures, Cognitive Measurement, College Students, Developmental Studies Programs

Davis, Caroline; Cowles, Michael – Educational and Psychological Measurement, 1989
Computerized and paper-and-pencil versions of four standard personality inventories administered to 147 undergraduates were compared for: (1) test-retest reliability; (2) scores; (3) trait anxiety; (4) interaction between method and social desirability; and (5) preferences concerning method of testing. Doubts concerning the efficacy of…
Descriptors: Comparative Analysis, Computer Assisted Testing, Higher Education, Personality Measures

Chletsos, Peter N.; And Others – Journal of Research and Development in Education, 1989
This article presents evidence of the reliability and validity of a new paper-and-pencil test of proportional reasoning, Paper-and-Pencil Balance Beam Test. A Total of 627 individuals, aged 8-47, participated in the 3 studies discussed. Results support previous research which correlates performance on proportional reasoning problems with…
Descriptors: Age Differences, Cognitive Development, Elementary Secondary Education, Formal Operations

Aiken, Lewis R. – Educational and Psychological Measurement, 1989
Two alternatives to traditional item analysis and reliability estimation procedures are considered for determining the difficulty, discrimination, and reliability of optional items on essay and other tests. A computer program to compute these measures is described, and illustrations are given. (SLD)
Descriptors: College Entrance Examinations, Computer Software, Difficulty Level, Essay Tests