Publication Date
| In 2026 | 0 |
| Since 2025 | 433 |
| Since 2022 (last 5 years) | 1911 |
| Since 2017 (last 10 years) | 4483 |
| Since 2007 (last 20 years) | 6968 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 830 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 159 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Lowenthal, Barbara – Academic Therapy, 1989
The special educator must be aware of possible sources of error in assessment of children with learning problems. Sources of error can be attributed to unconscious examiner bias, ambiguous test responses, linguistic and cultural differences of the examiner and examinee, previous test-taking experience, and problems with test reliability and…
Descriptors: Disabilities, Elementary Secondary Education, Evaluation Problems, Experimenter Characteristics
Peer reviewedJaradat, Derar; Tollefson, Nona – Educational and Psychological Measurement, 1988
This study compared the reliability and validity indexes of randomly parallel tests administered under inclusion, exclusion, and correction for guessing directions, using 54 graduate students. It also compared the criterion-referenced grading decisions based on the different scoring methods. (TJH)
Descriptors: Criterion Referenced Tests, Grading, Graduate Students, Guessing (Tests)
Peer reviewedGreenan, James P.; McCabe, Connie C. – Journal of Industrial Teacher Education, 1989
The authors developed, tested, and validated a set of student self-ratings, teacher ratings, and performance assessment instruments designed to measure generalizable reasoning skills of students enrolled in secondary vocational programs. They were found to be sufficiently reliable and valid indicators of functional learning strengths and…
Descriptors: Logical Thinking, Performance Based Assessment, Secondary Education, Self Evaluation (Individuals)
Peer reviewedRoberts, Clare; Pratt, Chris – Australasian Journal of Special Education, 1988
The study evaluated the psychometric properties of reliability and construct validity of the Attitude Toward Mainstreaming Scale (ATMS) in an Australian context. It was concluded that the scale is both reliable and factorially valid in an Australian context. (Author/DB)
Descriptors: Attitude Measures, Cultural Differences, Elementary Secondary Education, Foreign Countries
Peer reviewedStreufert, Siegfried; And Others – Personnel Psychology, 1988
Evaluated quasi-experimental simulation technique designed to measure impact of individual differences in managerial styles on executive performance. Tested 20 simulation-based measures for reliability and validity. Data from two samples suggest that this quasi-experimental simulation technology may be useful in assessing managerial styles not…
Descriptors: Administrator Qualifications, Competence, Evaluation Methods, Individual Differences
Peer reviewedRyser, Gail R. – Journal of Secondary Gifted Education, 1994
The meanings of reliability and validity as they apply to standardized measures are used as a framework for applying the concepts of reliability and validity to authentic assessments. This article sees reliability as scorability and stability, whereas validity is seen as students' ability to use knowledge authentically in the field. (DB)
Descriptors: Elementary Secondary Education, Evaluation Methods, Performance Based Assessment, Reliability
Peer reviewedLewis, Kerry E. – American Journal of Speech-Language Pathology, 1995
An examination of the extent to which scores on the Stuttering Severity Instrument (SSI) for Children and Adults, Third Edition, accurately reflect 10 judges' observations of stuttering behaviors found that SSI scores obscured the wide range of judges' raw counts and did not accurately reflect the observational data from which they were derived.…
Descriptors: Adults, Children, Evaluation Methods, Interrater Reliability
Peer reviewedSzajna, Bernadette – Educational and Psychological Measurement, 1994
Predictive validities of computer aptitude and computer anxiety were studied using nonprogramming computer performance as the criterion variable for 162 young adults. Effects of computer anxiety on performance were negligible, and computer aptitude yielded uncertain results. The measurement instruments' reliability was also reported. (SLD)
Descriptors: Aptitude, Computer Anxiety, Computer Attitudes, Computer Literacy
Peer reviewedMay, Kim; Nicewander, W. Alan – Journal of Educational Measurement, 1994
Reliabilities and information functions for percentile ranks and number-right scores were compared using item response theory, modeling standardized achievement tests. Results demonstrate that situations exist in which the percentage of items known by examinees can be accurately estimated, but the percentage of persons falling below a given score…
Descriptors: Achievement Tests, Difficulty Level, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedSong, Li-yu; And Others – Psychological Assessment, 1994
Measurement fidelity (reliability, factor structure, and validity) of Aschenbach's Youth Self-Report scale was studied with 226 adolescents at a psychiatric hospital. Findings confirm convergent validity and reliability of four of the measure's seven narrowband syndromes, and seven meaningful subdimensions were extracted from the other three…
Descriptors: Adolescents, Factor Analysis, Factor Structure, Measurement Techniques
Peer reviewedReckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models
Peer reviewedHigbee, Katherine R.; Roberts, Robert E. – Hispanic Journal of Behavioral Sciences, 1994
Eight-item revision of the UCLA Loneliness Scale was administered to 2,614 students, aged 11-14. Loneliness did not differ by age or between Anglo- and Mexican-American students, but was higher for girls than boys in each ethnic group. Principal components factor analysis and correlations with other related measures indicate good reliability and…
Descriptors: Affective Measures, Anglo Americans, Early Adolescents, Loneliness
Peer reviewedStumpf, Steven H. – Evaluation and the Health Professions, 1994
A five-year curriculum evaluation project is described that treated students' course ratings, examination reliability coefficients, and item-discrimination data as a battery of data points for determining annual revision efforts. Histograms were constructed to make valid demonstrations of successful efforts immediately comprehensible to faculty.…
Descriptors: College Faculty, Comprehension, Curriculum Evaluation, Longitudinal Studies
Peer reviewedAntonak, Richard F.; Larrivee, Barbara – Exceptional Children, 1995
Evidence supporting the use of a revision of the Opinions Relative to Mainstreaming scale, called Opinions Relative to Integration of Students with Disabilities, is presented. Scale testing with 376 professionals revealed satisfactory item characteristics, adequate reliability and homogeneity, and initial support for construct validity. The scale…
Descriptors: Attitude Measures, Disabilities, Elementary Secondary Education, Inclusive Schools
Peer reviewedSimpson, Robert G. – Behavioral Disorders, 1991
The behavior of each of 120 students in grades 9-12 was rated by 2 of the student's teachers using the Revised Behavior Problem Checklist. Results indicated a generally low to moderate degree of relationship among teacher ratings. It is recommended that clinicians collect behavioral ratings from many raters before reaching diagnostic conclusions.…
Descriptors: Behavior Problems, Check Lists, Clinical Diagnosis, Interrater Reliability


