Publication Date
| In 2026 | 0 |
| Since 2025 | 56 |
| Since 2022 (last 5 years) | 282 |
| Since 2017 (last 10 years) | 778 |
| Since 2007 (last 20 years) | 2040 |
Descriptor
| Interrater Reliability | 3122 |
| Foreign Countries | 654 |
| Test Reliability | 503 |
| Evaluation Methods | 502 |
| Test Validity | 410 |
| Correlation | 401 |
| Scoring | 347 |
| Comparative Analysis | 327 |
| Scores | 324 |
| Validity | 310 |
| Student Evaluation | 308 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 130 |
| Practitioners | 42 |
| Teachers | 22 |
| Administrators | 11 |
| Counselors | 3 |
| Policymakers | 2 |
Location
| Australia | 56 |
| Turkey | 53 |
| United Kingdom | 46 |
| Canada | 45 |
| Netherlands | 40 |
| China | 38 |
| California | 37 |
| United States | 30 |
| United Kingdom (England) | 24 |
| Taiwan | 23 |
| Germany | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 3 |
Eaton, Sarah Elaine; Crossman, Katherine – Interchange: A Quarterly Review of Education, 2018
Self-plagiarism is a contentious issue in higher education, research and scholarly publishing contexts. The practice is problematic because it disrupts scientific publishing by over-emphasizing results, increasing journal publication costs, and artificially inflating journal impact, among other consequences. We hypothesized that there was a dearth…
Descriptors: Plagiarism, Databases, Social Sciences, Journal Articles
Rideout, Blaire Moody – College and University, 2018
This study examined a holistic admissions review process at one institution to determine whether variance occurred and, if so, possible explanations for it. The study analyzed reader reviews for approximately 15,000 individual undergraduate admission reviews over three years. The primary method was focused on the variance and inter-rater…
Descriptors: Interrater Reliability, Holistic Approach, College Applicants, Evaluation Methods
Bosch, Nigel; Paquette, Luc – Journal of Learning Analytics, 2018
Metrics including Cohen's kappa, precision, recall, and F[subscript 1] are common measures of performance for models of discrete student states, such as a student's affect or behaviour. This study examined discrete model metrics for previously published student model examples to identify situations where metrics provided differing perspectives on…
Descriptors: Models, Comparative Analysis, Prediction, Probability
Bronkhorst, Hugo; Roorda, Gerrit; Suhre, Cor; Goedhart, Martin – Research in Mathematics Education, 2022
Logical reasoning as part of critical thinking is becoming more and more important to prepare students for their future life in society, work, and study. This article presents the results of a quasi-experimental study with a pre-test-post-test control group design focusing on the effective use of formalisations to support logical reasoning. The…
Descriptors: Mathematics Instruction, Teaching Methods, Logical Thinking, Critical Thinking
Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022
The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…
Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills
Wang, Peiyu; Coetzee, Karen; Strachan, Andrea; Monteiro, Sandra; Cheng, Liying – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2020
Internationally educated nurses' (IENs) English language proficiency is critical to professional licensure as communication is a key competency for safe practice. The Canadian English Language Benchmark Assessment for Nurses (CELBAN) is Canada's only Canadian Language Benchmarks (CLB) referenced examination used in the context of healthcare…
Descriptors: Item Response Theory, Language Tests, English (Second Language), Nurses
Koriakin, Taylor A.; McKee, Sarah L.; Schwartz, Marlene B.; Chafouleas, Sandra M. – Journal of School Health, 2020
Background: Stakeholders increasingly recognize the role of policy in implementing Whole School, Whole Community, Whole Child (WSCC) frameworks in schools; however, few tools are currently available to assess alignment between district policies and WSCC concepts. The purpose of this study was to expand the Wellness School Assessment Tool (WellSAT)…
Descriptors: School Policy, Health Services, Health Promotion, Wellness
McQuade, Richard; Kometa, Simon; Brown, Jeremy; Bevitt, Debra; Hall, Judith – Assessment & Evaluation in Higher Education, 2020
Research project modules are a key part of UK undergraduate and postgraduate bioscience degree programmes. Report marking invariably uses two assessors, but marking models are mixed with some institutions using two independent markers and others using the project supervisor as one of the assessors. This latter model is controversial with critics…
Descriptors: Foreign Countries, Research Projects, Student Research, Supervisors
Wang, Lifeng; Khalaf, Ahmad Taha; Lei, Dongyu; Gale, Mengke; Li, Jing; Jiang, Ping; Du, Jing; Yinayeti, Xuehereti; Abudureheman, Mayinuer; Wei, Yuanyuan – Advances in Physiology Education, 2020
Traditional oral examination (TOE) is criticized for the shortage of objectivity, standardization, and reliability. These perceived limitations can be mitigated by the introduction of structured oral examination (SOE). There is little evidence of the implementation of SOE in physiology laboratory courses. The purpose of this study was to…
Descriptors: Verbal Tests, Evaluation Methods, Science Laboratories, Physiology
Rogers, Kimberly Cervello; Petrulis, Robert; Yee, Sean P.; Deshler, Jessica – International Journal of Research in Undergraduate Mathematics Education, 2020
This paper presents the development and validation of the 17-item mathematics Graduate Student Instructor Observation Protocol (GSIOP) at two universities. The development of this instrument attended to some unique needs of novice undergraduate mathematics instructors while building on an existing instrument that focused on classroom interactions…
Descriptors: Measures (Individuals), Observation, Test Construction, Test Validity
Vibert, Bethany A.; Dufek, Sarah; Klein, Claire B.; Choi, Yeo Bi; Winter, Jamie; Lord, Catherine; Kim, So Hyun – Journal of Autism and Developmental Disorders, 2020
This study aimed to provide initial validity and reliability of the "Measure of NDBI Strategy Implementation-Caregiver Change" ("MONSI-CC"), a novel measure that captures changes in caregivers' implementation of NDBI strategies during early intervention. The MONSI-CC was applied to 119 observations of 43 caregiver-child dyads…
Descriptors: Child Caregivers, Autism, Pervasive Developmental Disorders, Test Validity
Kavanagh, Jennifer; Moran, Kieran; Issartel, Johann – Physical Education and Sport Pedagogy, 2020
Background: Cycling has gained more attention as an important lifelong physical activity. Learning to cycle independently without assistance is a milestone for most children that requires time and practice to master. Cycling was recently added to the motor development model and so a valid and reliable measure of cycling ability is required to…
Descriptors: Test Construction, Test Reliability, Physical Activities, Motor Development
Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Learning Disabilities, 2021
In this study, we examined the relationship of special education teachers' performance on the Recognizing Effective Special Education Teachers (RESET) Explicit Instruction observation protocol with student growth on academic measures. Special education teachers provided video-recorded observations of three instructional lessons along with data…
Descriptors: Special Education Teachers, Teacher Effectiveness, Teacher Evaluation, Direct Instruction
Safak, Pinar; Cakmak, Salih; Karakoc, Tamer; Aydin O'Dwyer, Pinar – European Journal of Educational Research, 2021
This study aimed to develop a valid and reliable instrument that measures the functional vision of students with low vision. Thus, an assessment tool and performance activities were developed for three vision skill groups (near vision skills, distance vision skills, and visual field) that include functional vision skills. The universe was 1485…
Descriptors: Foreign Countries, Vision Tests, Diagnostic Tests, Vision
Gübes, Nese Öztürk – Participatory Educational Research, 2021
The aim of this study is to show how a many-facet Rasch measurement model (MFRM) can be used for quality control whilst monitoring a musical aptitude examination. The data used in this study was gathered from a musical aptitude examination which was applied in 2019-2020 academic year for selecting teacher candidates to a music education department…
Descriptors: Foreign Countries, Music Education, Teacher Education Programs, Preservice Teacher Education

Peer reviewed
Direct link
