NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1,081 to 1,095 of 1,943 results Save | Export
Clark, John L. D. – 1986
A study of the reliability of the proficiency ratings scale and techniques used by three federal government agencies--the Central Intelligence Agency, the Defense Language Institute, and the Foreign Service Institute (FSI)--to test employees' oral language proficiency in French and German had two randomly selected two-person teams of testers from…
Descriptors: Comparative Analysis, Federal Government, French, German
Peer reviewed Peer reviewed
Magnan, Sally Sieloff – Canadian Modern Language Review, 1987
Differences in procedures used by academic institutions and government agencies in administering the American Council on the Teaching of Foreign Languages' Oral Proficiency Interview test are examined, and results and implications of two studies of interrater reliability are discussed. (MSE)
Descriptors: Comparative Analysis, Correlation, Evaluation Methods, Evaluators
Eason, Sandra H. – 1989
Generalizability theory provides a technique for accurately estimating the reliability of measurements. The power of this theory is based on the simultaneous analysis of multiple sources of error variances. Equally important, generalizability theory considers relationships among the sources of measurement error. Just as multivariate inferential…
Descriptors: Comparative Analysis, Generalizability Theory, Test Reliability, Test Theory
Peer reviewed Peer reviewed
Hofmann, Richard J. – Educational and Psychological Measurement, 1978
The Goodenough technique for determining scale error is compared to the Guttman technique and demonstrated to be more conservative than the Guttman technique. Implications with regard to Guttman's evaluative rule of thumb for evaluating a reproducibility are noted. (Author)
Descriptors: Comparative Analysis, Rating Scales, Statistical Analysis, Test Reliability
Zuravin, Susan J.; And Others – Child Abuse and Neglect: The International Journal, 1987
Anonymous reports (n=155) of child physical abuse in Baltimore (MD) were compared with reports made by professionals (n=588) and nonprofessionals (n=262) in terms of substantiation rate, seriousness of substantiated incidents, and severity of allegations. While anonymous reports were more likely to be unfounded, those that were substantiated were…
Descriptors: Child Abuse, Comparative Analysis, Professional Personnel, Reliability
Lefkowitz, David M. – J Counseling Psychol, 1970
Comparisons made in errors of classifications, percentage of overlapping, correlation of identical scales scored by both scoring procedures, intercorrelations of scales, and the ranking of scale scores within each subject showed that the two scoring systems produced different interest scores. (Author)
Descriptors: Comparative Analysis, Evaluation, Interest Inventories, Reliability
Peer reviewed Peer reviewed
Green, Samuel B. – Educational and Psychological Measurement, 1981
The proportion of agreement, G, and kappa indexes are shown to differ in how they correct for chance agreements between two observers. On the basis of the findings, it is suggested that no single agreement index is appropriate for all sets of data. (Author/BW)
Descriptors: Comparative Analysis, Measurement Techniques, Test Reliability, Testing Problems
Peer reviewed Peer reviewed
Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1994
An approximate statistical test of the equality of two intraclass reliability coefficients based on the same sample of people is derived. Such a test is needed when a researcher wishes to compare the reliability of two measurement procedures, and both procedures can be applied to results from the same group. (SLD)
Descriptors: Comparative Analysis, Measurement Techniques, Reliability, Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Ketelaars, Mieke P.; Cuperus, Juliane M.; van Daal, John; Jansonius, Kino; Verhoeven, Ludo – Research in Developmental Disabilities: A Multidisciplinary Journal, 2009
The present study examines the validity of the Dutch Children's Communication Checklist (CCC) for children in kindergarten in a community sample, in order to assess the feasibility of using it as a screening instrument in the general population. Teachers completed the CCC for a representative sample of 1396 children at kindergarten level, taken…
Descriptors: Check Lists, Emotional Problems, Language Impairments, Construct Validity
McNamara, T. F.; Adams, R. J. – 1991
A preliminary study is reported of the use of new multifaceted Rasch measurement mechanisms for investigating rater characteristics in language testing. Ratings from four judges of scripts from 50 candidates taking the International English Language Testing System test, a test of English for Academic Purposes, are analyzed. The analysis…
Descriptors: Comparative Analysis, English (Second Language), Foreign Countries, Interrater Reliability
Peer reviewed Peer reviewed
Rogers, W. Todd; Harley, Dwight – Educational and Psychological Measurement, 1999
Examined item-level and test-level characteristics for items in a high-stakes school-leaving mathematics examination. Results from 158 students show that the influence of testwiseness is lessened when three-option items are used. Tests of three-option items are at least equivalent to four-option item tests in terms of internal-consistency score…
Descriptors: Comparative Analysis, High School Students, High Schools, High Stakes Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Darby, Lynn A.; Marsh, Jennifer L.; Shewokis, Patricia A.; Pohlman, Roberta L. – Measurement in Physical Education and Exercise Science, 2007
To adhere to the principle of "exercise specificity" exercise testing should be completed using the same physical activity that is performed during exercise training. The present study was designed to assess whether aerobic step exercisers have a greater maximal oxygen consumption (max VO sub 2) when tested using an activity specific, maximal step…
Descriptors: Metabolism, Physical Activities, Exercise Physiology, Females
Peer reviewed Peer reviewed
Direct linkDirect link
Vogler, Kenneth E. – Educational Assessment, 2008
This study compared the impact of state accountability examinations on social studies teachers' instructional practices. Data were obtained from a survey instrument given to a representative sample of Mississippi teachers who teach the same content tested on their state's high-stakes high school graduation examination and a representative sample…
Descriptors: Academic Achievement, High Stakes Tests, Accountability, Social Studies
Peer reviewed Peer reviewed
Direct linkDirect link
Slade, Peter D.; Townes, Brenda D.; Rosenbaum, Gail; Martins, Isabel P.; Luis, Henrique; Bernardo, Mario; Martin, Michael D.; DeRouen, Timothy A. – Psychological Assessment, 2008
When serial neurocognitive assessments are performed, 2 main factors are of importance: test-retest reliability and practice effects. With children, however, there is a third, developmental factor, which occurs as a result of maturation. Child tests recognize this factor through the provision of age-corrected scaled scores. Thus, a ready-made…
Descriptors: Validity, Diagnostic Tests, Test Reliability, Children
Peer reviewed Peer reviewed
Direct linkDirect link
Ritvo, Riva Ariella; Ritvo, Edward R.; Guthrie, Donald; Yuwiler, Arthur; Ritvo, Max Joseph; Weisbender, Leo – Journal of Autism and Developmental Disorders, 2008
An empirically based 78 question self-rating scale based on DSM-IV-TR and ICD-10 criteria was developed to assist clinicians' diagnosis of adults with autism and Asperger's Disorder-the Ritvo Autism and Asperger's Diagnostic Scale (RAADS). It was standardized on 17 autistic and 20 Asperger's Disorder and 57 comparison subjects. Both autistic and…
Descriptors: Autism, Asperger Syndrome, Content Validity, Test Reliability
Pages: 1  |  ...  |  69  |  70  |  71  |  72  |  73  |  74  |  75  |  76  |  77  |  ...  |  130