Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Romero, Fernando; Paris, Scott G.; Brem, Sarah K. – Current Issues in Education, 2005
We examined underlying mechanisms for comprehension differences across expository and narrative text while controlling for factors confounded in the extant literature. Fourth grade students (n=32) read both an expository and a narrative text, and completed both a local comprehension assessment, and a global retelling assessment for each text.…
Descriptors: Reading Comprehension, Grade 4, Psycholinguistics, Models
Teachers' Use of Rubrics to Score Non-traditional Tasks: Factors Related to Discrepancies in Scoring
Meier, Sherry L.; Rich, Beverly S.; Cady, JoAnn – Assessment in Education: Principles, Policy and Practice, 2006
This study considered middle school mathematics teachers use of rubrics to score non-traditional tasks. A group of eighth-grade teachers attended a two-day workshop where they evaluated assessment tasks and discussed the use of an associated scoring rubric. Scored samples of student work submitted by the teachers indicated that they had difficulty…
Descriptors: Mathematics Teachers, Scoring Rubrics, Educational Practices, Knowledge Base for Teaching
Metcalf, Kim K.; Legan, Natalie A. – Journal of School Choice, 2006
School vouchers, particularly since the landmark U.S. Supreme Court ruling on the Cleveland program, are one of the most contentious issues in American education. Seemingly contradictory results across available studies have caused confusion among diverse audiences. The authors suggest that these divergent findings are, in part, due to three…
Descriptors: Educational Vouchers, School Choice, Performance Factors, Research Methodology
Christie, Christina A.; Azzam, Tarek – New Directions for Evaluation, 2005
The purpose of this issue of "New Directions for Evaluation" is to examine, comparatively, the practical application of theorists' approaches to evaluation by examining four evaluations of the same case. The thought is that when asked to evaluate the same program (holding the case constant), the practical distinctions between theorists' approaches…
Descriptors: Theory Practice Relationship, Interrater Reliability, Meta Analysis, Case Studies
Wood, Martin L.; Griffin, Danielle N.; Fredericks, Erika L.; Barrett, Ann C. – American Journal of Health Education, 2003
The International E-Mail Directory of Health Educators (http://www.hedir.siu.edu), or HEDIR, is an electronic mailing list specifically developed to facilitate communication among health education professionals worldwide. The study objectives were to characterize the nature of HEDIR messages and assess how well HEDIR is meeting members'…
Descriptors: Conferences (Gatherings), Check Lists, Electronic Mail, Health Education
Wright, Steven; McNeill, Michael; Fry, Joan; Tan, Steven; Tan, Clara; Schempp, Paul – Journal of Teaching in Physical Education, 2006
This study examined 49 student teachers' actions and perspectives when implementing a curricular innovation (the tactical games approach). Data were collected via videotaped lessons, interviews, and follow-up questionnaires. Questions for interviews and questionnaires were pilot tested and data were analyzed using the constant comparison method.…
Descriptors: Educational Innovation, Student Teachers, Student Teacher Attitudes, Videotape Recordings
Webb, Melvin W., II; Miller, Eva R. – 1995
As constructed-response items become an integral part of educational assessments, setting student performance standards on constructed-response items has become an important issue. Two standard-setting methods, one used for setting standards on the National Assessment of Educational Progress (NAEP) in reading in grade 8 and the other used to set…
Descriptors: Comparative Analysis, Constructed Response, Criteria, Educational Assessment
Goldberg, Gail Lynn; Michaels, Hillary – 1995
Preliminary data was gathered to guide subsequent research that will shape training procedures and scoring practice for performance assessment activities that integrate multiple content areas. Content area integration is a key feature of many of the tasks in the Maryland School Performance Assessment Program (MSPAP), a large-scale assessment of…
Descriptors: Elementary Education, Evaluators, Grade 3, Grade 8
Livingston, Samuel A.; Sims-Gunzenhauser, Alice – 1995
A study was conducted to provide information for setting two separate standards, the accuracy score and the documentation score, for the Praxis III: Classroom Performance Assessment (Praxis III). Praxis III is intended for making instructional and licensing decisions about beginning teachers. This standard-setting study was a person-judgment…
Descriptors: Beginning Teachers, Classroom Observation Techniques, Documentation, Elementary Secondary Education
Angoff, William H. – 1989
This study was undertaken to test the hypothesis that items of the Test of English as a Foreign Language (TOEFL) containing reference to American people, places, customs, etc., tend to favor examinees who have spent some time living in the United States. Two samples of examinees were drawn from the March 1987 TOEFL administration, one tested in…
Descriptors: Context Effect, English (Second Language), Evaluators, Foreign Nationals
Myford, Carol; And Others – 1993
The Educational Testing Service is developing a new generation of teacher assessments--the Praxis Series: Professional Assessments for Beginning Teachers. The assessment series consists of three components. An academic skills component will assess the candidate's basic academic and enabling skills. Subject assessments will test the candidate's…
Descriptors: Beginning Teachers, Classroom Techniques, Cultural Differences, Data Collection
Kaiser, Paul D.; Brull, Harry – 1994
The design, administration, scoring, and results of the 1993 New York State Correctional Captain Examination are described. The examination was administered to 405 candidates. As in previous Sergeant and Lieutenant examinations, candidates also completed latent image written simulation problems and open/closed book multiple choice test components.…
Descriptors: Competitive Selection, Correctional Rehabilitation, Decision Making, Educational Innovation
Joines, Richard C. – 1991
The development and validation of the General Management In-Basket (GMIB) is described. The GMIB is a theory-based generic in-basket simulation, designed to assess supervisory and management skills independent of any job classification. Three of the 15 in-basket items in the GMIB are critical and are scored on a 0-5 scale. The remaining 12 items…
Descriptors: Administrator Evaluation, Concurrent Validity, Factor Analysis, Interrater Reliability
Freeman, Donald J.; And Others – 1983
Earlier content analyses showed that the match between content covered by textbooks and tests varied as a function of the particular textbook and test a teacher was asked to use. This study tried to determine if the congruity in textbook-test content varied as a function of different styles of textbook use. Using year-long case studies of seven…
Descriptors: Classroom Techniques, Content Analysis, Grade 4, Intermediate Grades
Nelsen, Edward A.; Ray, William J. – 1983
The investigation examined relationships among scales for observing and rating teacher performance. Beginning teachers with varying levels of professional experience (2, 9, and 16 months) were rated by pairs of observers on two occasions. Intercorrelations across occasions fell between .5 and .8. Interrater agreement ranged between .5 and .9.…
Descriptors: Beginning Teachers, Correlation, Data Collection, Elementary School Teachers

Peer reviewed
Direct link
