NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,836 to 2,850 of 3,122 results Save | Export
Mitchell, Karen J.; Anderson, Judith A. – 1987
The Association of American Medical Colleges is conducting research to develop, implement, and evaluate a Medical College Admission Test (MCAT) essay testing program. Essay administration in the spring and fall of 1985 and 1986 suggested that additional research was needed on the development of topics which elicit similar skills and meet standard…
Descriptors: College Entrance Examinations, Essay Tests, Estimation (Mathematics), Generalizability Theory
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Peterson, Gary W. – 1983
Even though several national testing firms have developed measures to evaluate the effectiveness of baccalaureate education, there continues to be a general reluctance on the part of faculty in colleges and universities to accept these measures as criteria on which to evaluate educational programs. Some of the resistance appears to lie in the lack…
Descriptors: Bachelors Degrees, Cognitive Processes, Difficulty Level, Essay Tests
Walker, Richard N. – 1989
In an assessment of the adequacy of the Gesell screening examination as a test instrument, a Gesell Screening Evaluation was given to 400 children semi-annually from their 4th to 6th year. The sample, which was stratified by parent occupation, included 40 girls and 40 boys at 5 age levels. The test battery corresponded with the Gesell Preschool…
Descriptors: Chronological Age, Early Childhood Education, Followup Studies, Interrater Reliability
Ferrara, Steven F. – 1987
The necessity of controlling the order in which trained essay raters for a statewide writing assessment program receive student essays was studied. The underlying theoretical question concerns possible rater bias caused by raters reading long strings of essays of homogeneous quality; this problem is usually referred to as context effect or…
Descriptors: Context Effect, Essay Tests, Evaluators, Graduation Requirements
Yap, Kueh Chin; Capie, William – 1985
The purpose of this study was to compare the relative magnitude of the variance components and generalizability coefficients derived from the Teacher Performance Assessment Instruments (TPAI) data using two different methods of data collection: (1) occasions when observers were in the classroom for simultaneous observation and (2) occasions when…
Descriptors: Analysis of Variance, Classroom Observation Techniques, Data Collection, Elementary Secondary Education
Breland, Hunter M.; And Others – 1987
Six university English departments collaborated in this examination of the differences between multiple-choice and essay tests in evaluating writing skills. The study also investigated ways the two tools can complement one another, ways to improve cost effectiveness of essay testing, and ways to integrate assessment and the educational process.…
Descriptors: Comparative Testing, Efficiency, Essay Tests, Higher Education
Lange, Dale L.; Lowe, Pardee, Jr. – 1987
A study investigated the use of reading proficiency scales developed by the American Council on the Teaching of Foreign Languages (ACTFL), Educational Testing Service (ETS), and Interagency Language Roundtable (ILR) for meaningful rank-ordering and assigning levels of second language competence to reading passages. In a proficiency test writing…
Descriptors: College Entrance Examinations, Difficulty Level, Higher Education, Interrater Reliability
Dielman, T. E.; Horvatich, Paula K. – 1985
The purposes of this study were to establish the interrater reliability, dimensionality, and internal consistency of an instruction evaluation instrument used at The University of Michigan Medical School. Using the nine-item rating scale, 1,758 student ratings and 88 staff ratings were gathered on 61 faculty. Interrater agreement ranged from .28…
Descriptors: Evaluation Methods, Graduate Medical Education, Higher Education, Interrater Reliability
Busch, John Christian; Jaeger, Richard M. – 1984
This study addressed seven questions regarding the methods used in setting passing scores on the essay subtest of the National Teacher Examinations (NTE) Communication Skills test for the North Carolina State Board of Education. North Carolina uses these tests to screen prospective applicants to teacher education programs. The judges (five college…
Descriptors: College Entrance Examinations, Criterion Referenced Tests, Cutting Scores, Essay Tests
van der Linden, Wim J. – 1982
A latent trait method is presented to investigate the possibility that Angoff or Nedelsky judges specify inconsistent probabilities in standard setting techniques for objectives-based instructional programs. It is suggested that judges frequently specify a low probability of success for an easy item but a large probability for a hard item. The…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Interrater Reliability
Peer reviewed Peer reviewed
Ronka, Carol S.; Barnett, David – Special Services in the Schools, 1986
A study examined teachers' and parents' ratings of 39 developmentally handicapped (DH) and 9 learning disabled children on the revised Vineland and the AAMD Adaptive Behavior Scale. Teachers rated children eligible for DH placement more frequently than parents. The adaptive behavior ratings of the two instruments differed significantly. (Author/CB)
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Developmental Disabilities, Disability Identification
Peer reviewed Peer reviewed
Mayfield, Kathy L.; And Others – Journal of School Psychology, 1984
Investigated interrater reliability of the AAMD Adaptive Behavior Scale-Public School Version in a sample of 31 educable mentally handicapped children who were rated by their parents, special education teacher, classroom teacher, and an independent observer. Results showed ratings of the special education teacher were generally lower. (JAC)
Descriptors: Adjustment (to Environment), Behavior Rating Scales, Children, Elementary Education
PDF pending restoration PDF pending restoration
Hori, Utako; Ito, Tokumi; Kitazawa, Mieko; Masuda, Masako; Ogiwara, Chikako; Saito, Mariko; Yoneda, Yukiyo – 1996
A group of seven Japanese-language Oral Proficiency Interview (OPI) testers licensed by the American Council on the Teaching of foreign Languages (ACTFL) conducted research related to ACTFL-OPI criteria. They first examined 24 audiotaped interview tests to see what kind of consistency there would be when individual testers applied general criteria…
Descriptors: Audiotape Recordings, Comparative Analysis, Foreign Countries, Grammar
McGinty, Dixie; Neel, John H.; Hsu, Yu-Sheng – 1996
The cognitive components standard setting method, recently introduced by D. McGinty and J. Neel (1996), asks judges to specify minimum levels of performance not for the test items, but for smaller portions of items, the component skills and concepts required to answer each item correctly. Items are decomposed into these components before judges…
Descriptors: Cognitive Processes, Criterion Referenced Tests, Elementary Education, Evaluation Methods
Pages: 1  |  ...  |  186  |  187  |  188  |  189  |  190  |  191  |  192  |  193  |  194  |  ...  |  209