Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Debate Philosophy Statements as Predictors of Critic Attitudes: A Summary and Direction of Research.
Dudczak, Craig; Day, Donald – 1991
Philosophy statements have been used in the National Debate Tournament (NDT) since the mid-1970s and the Cross Examination Debate Association (CEDA) National Tournament since its 1986 inception. The statements should help debaters adapt to critics' expressed preferences. Moreover, philosophy statements can guide the study of argumentation theory…
Descriptors: Comparative Analysis, Content Analysis, Debate, Higher Education
Brown, William L.; Stevens, Betty L. – 1992
The objectives of this study were to determine whether student writing portfolios could be rated reliably by trained judges; study the effects on student ratings of the differential leniency of the judges; and ascertain the effects of writing-prompt difficulty and its interactions with rater leniency. Writing samples from 127 students in grades 3,…
Descriptors: Elementary Education, Evaluation Methods, Interrater Reliability, Judges
Barnwell, David – 1989
A study addressed issues of concern in the use of the American Council on the Teaching of Foreign Languages (ACTFL)/Educational Testing Service (ETS) Language Proficiency Guidelines commonly used in determination of oral language proficiency. Specifically, potential discrepancies between the judgments of trained raters and "naive" native…
Descriptors: Interrater Reliability, Interviews, Language Proficiency, Language Tests
Woolever, Roberta – 1990
A protocol for analyzing the activity structure of a classroom lesson was developed and field tested. The protocol can be completed on site by a person serving as both observer and analyst. Validity of the protocol was established by reference to the large body of qualitative research on activity structure and the analysis of teaching. The…
Descriptors: Classroom Observation Techniques, Elementary Secondary Education, Evaluators, Graduate Students
Johnson, Helen L.; Rosen, Tove S. – 1986
The study compared maternal and trained observer evaluations of infant temperamental characteristics, to determine how closely the ratings correspond, and to analyze the impact of maternal drug abuse habits on maternal ratings of infant temperament. In relating observer to maternal ratings of infant temperament, seven dimensions were compared:…
Descriptors: Child Rearing, Drug Abuse, Infant Behavior, Infants
Paden, Patricia A. – 1986
Two factors which may affect the ratings assigned to an essay test are investigated: (1) context effects; and (2) score level effects. Context effects exist in essay scoring if an essay is rated higher when preceded by poor quality essays than when preceded by high quality essays. A score level effect is defined as a change in the score (value)…
Descriptors: Context Effect, Essay Tests, Holistic Evaluation, Interrater Reliability
Erwin, T. Dary – 1988
Rating scales are a typical method for evaluating a student's performance in outcomes assessment. The analysis of the quality of information from rating scales poses special measurement problems when researchers work with faculty in their development. Generalizability measurement theory offers a set of techniques for estimating errors or…
Descriptors: Educational Assessment, Generalizability Theory, Higher Education, Institutional Research
Plake, Barbara S.; And Others – 1989
The accuracy of standards obtained from judgmental methods is dependent on the quality of the judgments made by experts throughout the standard setting process. One important dimension of the quality of these judgments is the consistency of the judges' perceptions with item performance of minimally competent candidates. Several interrelated…
Descriptors: Cutting Scores, Evaluation Methods, Evaluative Thinking, Evaluators
Lehmann, Rainer H. – 1987
A total of 1,487 eleventh grade students from the Hamburg (West Germany) school system were asked to complete four writing assignments used in an International Association for the Evaluation of Educational Achievement (IEA) study of writing assessment. In analyzing the writing samples, the study focused on: (1) between-rater effects; (2)…
Descriptors: Evaluation Problems, Foreign Countries, High Schools, International Programs
Cooper, Harris M. – 1985
A taxonomy for literature reviews in education and psychology is presented. The increased use of the descriptor "literature review" in ERIC and Psychological Abstracts documents between 1969 and 1983 is cited as creating the need for categorization. The taxonomy categorizes reviews according to focus, goal, perspective, coverage,…
Descriptors: Classification, Content Analysis, Databases, Educational Research
Congleton, Donna McKinley – 1982
A scale was developed and tested to measure metaphoric complexity in order to aid teachers in the selection and sequencing of young adult (YA) novels in the English curriculum. The development and testing of the scale involved two stages: (1) the development of a questionnaire to determine if differences in the metaphoric complexity of examples…
Descriptors: Adolescent Literature, Content Analysis, Difficulty Level, English Curriculum
Peer reviewedGjessing, Hans-Jorgen – Scandinavian Journal of Educational Research, 1986
The terms reading disability and dyslexia are discussed, as well as the meaning of function analysis as a way of diagnosing behavior and difficulties in reading and spelling. The author's model classifies reading disability and dyslexia as auditory, auditory-visual, visual, emotional, or pedagogic. The Bergen Study is described. (Author/LMO)
Descriptors: Auditory Discrimination, Dyslexia, Elementary Education, Foreign Countries
Peer reviewedMills, Janet – Bulletin of the Council for Research in Music Education, 1987
Questions the extent to which assessment of solo musical performance can be made under the General Certificate of School Education exam in England and Wales. Discusses performances as criterion. Reports on experiment which attempted to assess a student's overall music performance. Offers a model which can be used to better measure solo music…
Descriptors: Educational Research, Educational Testing, Foreign Countries, Interrater Reliability
Peer reviewedPye, Clifton; And Others – Journal of Child Language, 1988
Analysis and comparison of three independent transcriptions of the same speech sample collected from a hearing child with deaf parents resulted in two descriptions of the child's phonological system--one based on a liberal estimate and the other a conservative estimate of the potential error in the transcripts. (Author/CB)
Descriptors: Child Language, Comparative Analysis, Deafness, Error Analysis (Language)
Peer reviewedKilpatrick, Doreen; Duncan, Pam – Child Study Journal, 1985
Examines effect of sex of rater on the interaction and degree of consensus obtained in ratings of child behavior. Children between the ages of 6 and 13 in a residential treatment center were rated on two rating scales. Explanations of results are offered in terms of sex role expectations and stereotyping. (Author/DST)
Descriptors: Adolescents, Behavior Problems, Behavior Rating Scales, Children


