Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 2 |
Descriptor
| Evaluators | 8 |
| Interrater Reliability | 8 |
| Testing | 8 |
| Language Tests | 4 |
| English (Second Language) | 3 |
| Language Proficiency | 3 |
| Rating Scales | 3 |
| Scoring | 3 |
| Evaluation Criteria | 2 |
| Oral Language | 2 |
| Performance Based Assessment | 2 |
| More ▼ | |
Author
| Brull, Harry | 1 |
| Burry, James | 1 |
| Clevinger, Amanda | 1 |
| Collier, Michael | 1 |
| Crossley, Scott | 1 |
| Kaiser, Paul D. | 1 |
| Kenyon, Dorry | 1 |
| Kim, YouJin | 1 |
| Quellmalz, Edys S. | 1 |
| Reed, Deborah K. | 1 |
| Stansfield, Charles W. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 5 |
| Journal Articles | 3 |
| Speeches/Meeting Papers | 3 |
| Reports - Evaluative | 2 |
| Reports - Descriptive | 1 |
Education Level
| Grade 6 | 1 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| Middle Schools | 1 |
Audience
| Practitioners | 1 |
| Teachers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
| ACTFL Oral Proficiency… | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Reed, Deborah K.; Sturges, Keith M. – Remedial and Special Education, 2013
Researchers have expressed concern about "implementation" fidelity in intervention research but have not extended that concern to "assessment" fidelity, or the extent to which pre-/posttests are administered and interpreted as intended. When studying reading interventions, data gathering heavily influences the identification of…
Descriptors: Reading Tests, Fidelity, Pretests Posttests, Intervention
Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014
There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…
Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)
Peer reviewedCollier, Michael – Assessment and Evaluation in Higher Education, 1986
A study revealing wide variation in the grading of electronics engineering test items by different evaluators has implications for evaluator and test item selection, analysis and manipulation of grades, and the use of numerical methods of assessment. (MSE)
Descriptors: Electronics, Engineering Education, Evaluation Methods, Evaluators
Weigle, Sara Cushing – 1994
This paper describes a study on rater training that involved the analysis of ratings given to English-as-a-Second-Language (ESL) compositions by 8 inexperienced and 8 experienced raters both before and after rater training, using FACETS (Linacre, 1990, 1993), which provides measures of rater severity and consistency. The testing text was a…
Descriptors: English (Second Language), Essay Tests, Evaluation Criteria, Evaluators
Thompson, Irene – 1995
This report addresses the reliability of the American Council on the Teaching of Foreign Languages (ACTFL) Oral Proficiency Interview (OPI), not as a measure of speaking ability, but rather as practiced by testers trained by the ACTFL, such as by the Interagency Language Roundtable (ILR), in English as a Second Language (ESL), French, German,…
Descriptors: English (Second Language), Evaluators, French, German
Kenyon, Dorry; Stansfield, Charles W. – 1993
This paper examines whether individuals who train themselves to score a performance assessment will rate acceptably when compared to known standards. Research on the efficacy of rater self-training materials developed by the Center for Applied Linguistics for the Texas Oral Proficiency Test (TOPT) is examined. Rater self-materials are described…
Descriptors: Bilingual Education, Comparative Analysis, Evaluators, Individual Characteristics
Quellmalz, Edys S.; Burry, James – 1983
The Center for the Study of Evaluation's (CSE) expository and narrative rating scales have been developed to meet the need for instructionally relevant methods for assessing students' writing competence. Research indicates that large numbers of raters can be trained in the use of these scales and that, during training and independent rating, they…
Descriptors: Evaluation Criteria, Evaluators, Expository Writing, Holistic Evaluation
Kaiser, Paul D.; Brull, Harry – 1994
The design, administration, scoring, and results of the 1993 New York State Correctional Captain Examination are described. The examination was administered to 405 candidates. As in previous Sergeant and Lieutenant examinations, candidates also completed latent image written simulation problems and open/closed book multiple choice test components.…
Descriptors: Competitive Selection, Correctional Rehabilitation, Decision Making, Educational Innovation

Direct link
