Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedRozeboom, William W. – Applied Psychological Measurement, 1989
Formulas are provided for estimating the reliability of a linear composite of non-equivalent subtests given the reliabilities of component subtests. The reliability of the composite is compared to that of its components. An empirical example uses data from 170 children aged 4 through 8 years performing 34 Piagetian tasks. (SLD)
Descriptors: Elementary School Students, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewedHubbard, J. I.; Seddon, G. M. – Research in Science and Technological Education, 1989
Investigates differences in marking standard and reliability when experienced teachers carried out assessments of the performance on practical exercises. The results showed that there was no difference between the assessments from the groups containing 5 and 20 students. (Author/YP)
Descriptors: Evaluation Research, Foreign Countries, Group Testing, Science Teachers
Peer reviewedWeber, Larry – Educational Research Quarterly, 1988
A 34-item instrument for assessing attitudes about merit pay for teachers was developed and administered to 237 teachers and 86 administrators. Two months later, in a replication study, the instrument was administered to 193 teachers, 107 administrators, and 135 parents. The instrument has high reliability. (SLD)
Descriptors: Administrator Attitudes, Administrators, Attitude Measures, Educational Assessment
Peer reviewedHakstian, A. Ralph; And Others – Psychometrika, 1988
A model and computation procedure based on classical test score theory are presented for determination of a correlation coefficient corrected for attenuation due to unreliability. Delta and Monte Carlo method applications are discussed. A power analysis revealed no serious loss in efficiency resulting from correction for attentuation. (TJH)
Descriptors: Correlation, Equations (Mathematics), Hypothesis Testing, Mathematical Models
Peer reviewedCleary, Christopher – British Journal of Language Teaching, 1988
A comparison of the holistic, error-count, and categorical methods of assessing written work presents disadvantages and advantages in terms of validity, reliability, and efficiency. The results of a pilot study indicate that the error-count method may provide the most benefits. (CB)
Descriptors: English (Second Language), Error Analysis (Language), Evaluation Methods, Holistic Evaluation
Peer reviewedFielding, David W.; And Others – American Journal of Pharmaceutical Education, 1994
A test of pharmacists' practice knowledge, for use in professional continuing education, was developed by having practitioners prepare 673 assessment items related to specific competencies. Item pool was refined by psychometric evaluation and content validation. Two parallel tests, each containing 120 items, were pilot-tested with first-year…
Descriptors: Competency Based Education, Higher Education, Knowledge Level, Pharmaceutical Education
Peer reviewedDouglas, Dan – Annual Review of Applied Linguistics, 1995
Reviews recent theoretical, methodological, and analytical developments in language testing, focusing on more refined models of language ability, reliability and validity, performance testing, innovative test formats, new applications of Item Response Theory and Generalizability Theory to test performance. An annotated bibliography discusses seven…
Descriptors: Annotated Bibliographies, Evaluation Methods, Language Proficiency, Language Tests
Peer reviewedLee, Steven W.; And Others – Behavioral Disorders, 1994
The Child Behavior Checklist and related forms were completed for 171 boys referred for school-based assessment resulting from academic and/or behavioral problems. Adolescents consistently underreported behavioral problems relative to parents and teachers regardless of subsequent diagnosis. Implications of these discrepancies in school-based…
Descriptors: Adolescents, Behavior Problems, Disability Identification, Educational Diagnosis
Peer reviewedFantuzzo, John; And Others – Early Childhood Research Quarterly, 1995
A study developed and validated the Penn Interactive Peer Play Scale (PIPPS), a teacher-rating instrument of the interactive play behaviors of preschool children. Thirty-eight teachers completed the measure on 312 African American children enrolled in Head Start. Exploratory factor analysis revealed three reliable underlying dimensions: play…
Descriptors: Behavior Rating Scales, Blacks, Early Childhood Education, Interpersonal Competence
Peer reviewedNoijons, Jose – CALICO Journal, 1994
Defines computer assisted language testing (CALT), discusses the various processes involved, outlines the advantages and disadvantages, and examines psychometric aspects of computer testing. A table of factors distinguishes between test content and the mechanics of test taking. These factors constitute a table for developing a CALT checklist. (24…
Descriptors: Check Lists, Computer Assisted Testing, Factor Analysis, Feedback
Peer reviewedStiles, Joan – Monographs of the Society for Research in Child Development, 1994
Considers the bases of criticism of parent report as an index of their children's behavioral development and ways in which problems associated with parent report were addressed in the construction of the MacArthur Communicative Development Inventories (CDIs). Examines the nature of responses elicited from parents as they complete the CDIs. (BC)
Descriptors: Behavior Development, Body Language, Child Behavior, Data Collection
Peer reviewedGrant, Carolyn D.; Nash, Michael R. – Psychological Assessment, 1995
In a counterbalanced, within subjects, repeated measures design, 130 undergraduates were administered the Computer-Assisted Hypnosis Scale (CAHS) and the Stanford Hypnotic Susceptibility Scale and were hypnotized. The CAHS was shown to be a psychometrically sound instrument for measuring hypnotic ability. (SLD)
Descriptors: Ability, Clinical Diagnosis, Computer Assisted Testing, Diagnostic Tests
Peer reviewedBeidel, Deborah C.; And Others – Psychological Assessment, 1995
A new instrument, the Social Phobia and Anxiety Inventory for Children (SPAI-C), was developed. Results from 6 studies with nearly 600 children indicate that the SPAI-C is a reliable and valid measure for childhood social anxiety and fear. It may be useful for improving clinical assessment and documenting treatment outcomes. (SLD)
Descriptors: Anxiety, Children, Clinical Diagnosis, Diagnostic Tests
Snyder, Scott; Sheehan, Robert – Diagnostique, 1992
Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)
Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity
Peer reviewedKunnan, Antony John – Language Testing, 1992
Three analysis procedures were used to study the dependability and validity of ESLPE, a criterion-referenced English-as-a-Second-Language placement test developed at the University of California at Los Angeles in 1989. Findings led to the suggestion that some students might have been differently placed if subtest scores were used for placement.(38…
Descriptors: Cluster Analysis, Comparative Analysis, Criterion Referenced Tests, English (Second Language)


