NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 46 to 60 of 2,533 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018
In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…
Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Lin, Chih-Kai; Zhang, Jinming – Journal of Educational Measurement, 2018
Under the generalizability-theory (G-theory) framework, the estimation precision of variance components (VCs) is of significant importance in that they serve as the foundation of estimating reliability. Zhang and Lin advanced the discussion of nonadditivity in data from a theoretical perspective and showed the adverse effects of nonadditivity on…
Descriptors: Generalizability Theory, Reliability, Computation, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kladouchou, Vasiliki; Papathanasiou, Ilias; Efstratiadou, Eva A.; Christaki, Vasiliki; Hilari, Katerina – International Journal of Language & Communication Disorders, 2017
Background & Aims: This study ran within the framework of the Thales Aphasia Project that investigated the efficacy of elaborated semantic feature analysis (ESFA). We evaluated the treatment integrity (TI) of ESFA, i.e., the degree to which therapists implemented treatment as intended by the treatment protocol, in two different formats:…
Descriptors: Aphasia, Semantics, Speech Therapy, Group Therapy
Peer reviewed Peer reviewed
Direct linkDirect link
Britton, Emily; Simper, Natalie; Leger, Andrew; Stephenson, Jenn – Assessment & Evaluation in Higher Education, 2017
Effective teamwork skills are essential for success in an increasingly team-based workplace. However, research suggests that there is often confusion concerning how teamwork is measured and assessed, making it difficult to develop these skills in undergraduate curricula. The goal of the present study was to develop a sustainable tool for assessing…
Descriptors: Teamwork, Undergraduate Students, Skills, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Huebner, Alan; Lucht, Marissa – Practical Assessment, Research & Evaluation, 2019
Generalizability theory is a modern, powerful, and broad framework used to assess the reliability, or dependability, of measurements. While there exist classic works that explain the basic concepts and mathematical foundations of the method, there is currently a lack of resources addressing computational resources for those researchers wishing to…
Descriptors: Generalizability Theory, Test Reliability, Computer Software, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Flake, Jessica Kay; Petway, Kevin Terrance, II – Educational Measurement: Issues and Practice, 2019
Numerous studies merely note divergence in students' and teachers' ratings of student noncognitive constructs. However, given the increased attention and use of these constructs in educational research and practice, an in-depth study focused on this issue was needed. Using a variety of quantitative methodologies, we thoroughly investigate…
Descriptors: Teachers, Students, Achievement Rating, Interrater Reliability
Benton, Tom; Leech, Tony; Hughes, Sarah – Cambridge Assessment, 2020
In the context of examinations, the phrase "maintaining standards" usually refers to any activity designed to ensure that it is no easier (or harder) to achieve a given grade in one year than in another. Specifically, it tends to mean activities associated with setting examination grade boundaries. Benton et al (2020) describes a method…
Descriptors: Mathematics Tests, Equated Scores, Comparative Analysis, Difficulty Level
Peer reviewed Peer reviewed
Direct linkDirect link
Park, HwaChoon; Hill, Roger B. – Career and Technical Education Research, 2018
The Employability Skills Assessment (ESA) was translated into Korean (KESA) and the construct validity and reliabilities of the KESA were examined to provide Koreans with a scientific research-based work ethic measure. A total of 896 Korean Baby Boomers (1955-1963), Generation X (1964-1981) and Millennials (1982-1999) provided data. Work ethic was…
Descriptors: Foreign Countries, Employment Qualifications, Job Skills, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Rusticus, Shayna A.; Wilson, Derek; Jarus, Tal; O'Flynn-Magee, Kathy; Albon, Simon – Learning Environments Research, 2022
The desire to support student learning and professional development, in combination with accreditation requirements, necessitates the need to evaluate the learning environment of educational programs. The Health Education Learning Environment Survey (HELES) is a recently-developed global measure of the learning environment for health professions…
Descriptors: Medical Students, Student Attitudes, Allied Health Occupations Education, Educational Environment
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Patil, Yogendra J. – Educational and Psychological Measurement, 2018
Recent research has explored the use of models adapted from Mokken scale analysis as a nonparametric approach to evaluating rating quality in educational performance assessments. A potential limiting factor to the widespread use of these techniques is the requirement for complete data, as practical constraints in operational assessment systems…
Descriptors: Scaling, Data, Interrater Reliability, Writing Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Goldammer, Philippe; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018
A readily applicable procedure is discussed that allows evaluation of the discrepancy between the popular coefficient alpha and the reliability coefficient of a scale with second-order factorial structure that is frequently of relevance in empirical educational and psychological research. The approach is developed within the framework of the…
Descriptors: Test Reliability, Factor Structure, Statistical Analysis, Computation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Garcia-Garzon, Eduardo; Abad, Francisco J.; Garrido, Luis E. – Journal of Intelligence, 2019
There has been increased interest in assessing the quality and usefulness of short versions of the Raven's Progressive Matrices. A recent proposal, composed of the last twelve matrices of the Standard Progressive Matrices (SPM-LS), has been depicted as a valid measure of "g." Nonetheless, the results provided in the initial validation…
Descriptors: Intelligence Tests, Test Validity, Evaluation Methods, Undergraduate Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Welch, Adam C.; Karpen, Samuel C.; Cross, L. Brian; LeBlanc, Brandie N. – Research & Practice in Assessment, 2017
The aims of this study were to determine faculty's ability to accurately and reliably categorize exam questions using Bloom's Taxonomy, and if modified versions would improve the accuracy and reliability. Faculty experience and affiliation with a health sciences discipline were also considered. Faculty at one university were asked to categorize 30…
Descriptors: College Faculty, Medical School Faculty, Health Sciences, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Saux, Gaston; Ros, Christine; Britt, M. Anne; Stadtler, Marc; Burin, Debora I.; Rouet, Jean-François – Discourse Processes: A Multidisciplinary Journal, 2018
In two experiments, undergraduate students read short texts containing two embedded sources that could either agree or disagree with each other. Participants' memory for the sources' identity (i.e., occupation) and features (i.e., the source's access to knowledge and the source's physical appearance) was examined as a function of the consistency…
Descriptors: Recall (Psychology), Reading, Undergraduate Students, Information Sources
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  169