NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 2,236 to 2,250 of 3,124 results Save | Export
Edinger, Jack D.; Vosk, Barbara N. – 1983
Of the many short forms of the Minnesota Multiphasic Personality Inventory (MMPI) that have been developed, the MMPI-168 is among the most promising. To determine whether clinical judgments based on the MMPI-168 are comparable to judgments based on the standard MMPI, 30 clinical psychologists participated in a randomized block, repeated treatment…
Descriptors: Comparative Testing, Diagnostic Tests, Interrater Reliability, Personality Measures
Thompson, Richard T.; Johnson, Dora E. – 1988
Efforts to expand the generic language proficiency guidelines of the American Council on the Teaching of Foreign Languages (ACTFL) to the less commonly taught languages (LCTLs) began when developers realized that the ACTFL guidelines were too Eurocentric; the guidelines included grammatical categories specific to Western European languages and…
Descriptors: Cultural Context, Interrater Reliability, Language Proficiency, Language Tests
Halpin, Gerald; And Others – 1986
Based upon the assumption that the process of peer review of publications and research is flawed, interrater reliability of reviews of 188 research proposals submitted for funding at a major university was studied. The eight dimensions rated were: (1) significance of the research; (2) clarity and reasonableness of the objectives; (3)…
Descriptors: College Faculty, Evaluation Criteria, Evaluators, Grants
Peer reviewed Peer reviewed
Blok, H. – Journal of Educational Measurement, 1985
Raters judged essays on two occasions making it possible to address the question of whether multiple ratings, however obtained, represent the same true scores. Multiple ratings of a given rater did represent the same true scores, but ratings of different raters did not. Reliability, validity, and invalidity coefficients were computed. (Author/DWH)
Descriptors: Analysis of Variance, Elementary Education, Essay Tests, Evaluators
Peer reviewed Peer reviewed
Houston, Samuel R.; And Others – Journal of Experimental Education, 1987
Judgment analysis (JAN) and JAN-paired comparison (JAN-PC) were compared as a statistical policy-capturing strategy. The readability of 15 selected passages used in grades 1-6 was evaluated. More consistent results for each judge (five Arabic teachers of reading) and fewer policies were obtained with JAN-PC than with JAN. (TJH)
Descriptors: Arabic, Elementary School Teachers, Interrater Reliability, Judgment Analysis Technique
Peer reviewed Peer reviewed
Powers, Stephen; And Others – Educational and Psychological Measurement, 1985
Results of an administration of the Language Proficiency Measure indicated that the interrater reliability was adequate, internal-consistency reliability estimates were high, concurrent validity coefficients were adequate, and the classification validity was acceptable. (Author/LMO)
Descriptors: Elementary Education, Interrater Reliability, Language Proficiency, Language Tests
Peer reviewed Peer reviewed
Friedman, Lee; Harvey, Robert J. – Personnel Psychology, 1986
Job-naive raters provided with job descriptive information made Position Analysis Questionnaire (PAQ) ratings which were validated against ratings of job analysts who were also job content experts. None of the reduced job descriptive information conditions enabled job-naive raters to obtain either acceptable levels of convergent validity with…
Descriptors: College Students, Evaluation Methods, Evaluators, Higher Education
Peer reviewed Peer reviewed
Weinrott, Mark R.; Jones, Richard R. – Child Development, 1984
Examines the tendency of observers to make less reliable recordings of behavorial events when a calibrating observer is absent. Using four different multicategory systems, 26 experienced observers coded 200 hours of videotaped family interactions. Concludes that observers lapse into a less attentive "set" prior to coding without a…
Descriptors: Adults, Behavior Patterns, Behavior Rating Scales, Family (Sociological Unit)
Peer reviewed Peer reviewed
Hartman, Bruce W.; And Others – Journal of Experimental Education, 1986
The detrimental effects of nonresponse bias are particularly significant given the widespread use of the survey data collection method in educational surveys. Current methods for remediating nonresponse bias in educational surveys are explored and critiqued. (Author/LMO)
Descriptors: Data Collection, Educational Assessment, Elementary Secondary Education, Error of Measurement
Peer reviewed Peer reviewed
Saudargas, Richard A.; Lentz, Frances E., Jr. – School Psychology Review, 1986
Using development of a State Event Observation System as an example, the decision rules and procedures for the constructing of standardized multiple behavior observational systems that provide accurate, reliable data for school-based assessment, intervention, and research are described. Reliability and validity data from the SECOS are provided.…
Descriptors: Classroom Observation Techniques, Elementary Education, Interrater Reliability, Measurement
Peer reviewed Peer reviewed
Fuchs, Douglas; And Others – Journal of Educational Research, 1985
A study investigated whether examiners' personal familiarity and professional experience with examinees affects handicapped children's test performance. Professionally experienced and inexperienced examiners were used in test conditions that varied by degree of personal familiarity of examiners and children. Results are discussed. (Author/DF)
Descriptors: Disabilities, Examiners, Experimenter Characteristics, Interpersonal Relationship
Peer reviewed Peer reviewed
Henk, William A.; Selders, Mary L. – Reading Teacher, 1984
Shows that synonymic scoring of cloze tests is highly variable--that the score seems to appear simply on who grades the test. (FL)
Descriptors: Cloze Procedure, Interrater Reliability, Reading Instruction, Reading Research
Peer reviewed Peer reviewed
Johnson, Brian W. – Educational and Psychological Measurement, 1983
Regression analyses indicated that the Coopersmith Self-Esteem Inventory has convergent validity with regard to the Piers-Harris Children's Self-Concept Scale and the Coopersmith Behavioral Academic Assessment Scale, has discriminant validity with regard to the Children's Social Desirability Scale, is sensitive to differences in achievement level,…
Descriptors: Academic Achievement, Intermediate Grades, Interrater Reliability, Self Concept Measures
Popp, Sharon E. Osborn; Ryan, Joseph M.; Thompson, Marilyn S.; Behrens, John T. – 2003
The purposes of this study were to investigate the role of benchmark writing samples in direct assessment of writing and to examine the consequences of differential benchmark selection with a common writing rubric. The influences of discourse and grade level were also examined within the context of differential benchmark selection. Raters scored…
Descriptors: Benchmarking, Elementary Education, Elementary School Students, Interrater Reliability
Peer reviewed Peer reviewed
Floyd, Frank J.; Markman, Howard J. – Journal of Consulting and Clinical Psychology, 1983
Examined couples' and observers' perspectives of marital interaction. Nondistressed (N=10) and distressed (N=6) couples and objective observers (N=10) evaluated couples' interactional behaviors. Spouses' ratings of their partners' behavior were not consistent with observers' ratings of partners' behaviors but were consistent with observers'…
Descriptors: Cognitive Style, Evaluation Methods, Interaction, Interpersonal Relationship
Pages: 1  |  ...  |  146  |  147  |  148  |  149  |  150  |  151  |  152  |  153  |  154  |  ...  |  209