Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Zijlmans, Eva A. O.; Tijmstra, Jesper; van der Ark, L. Andries; Sijtsma, Klaas – Educational and Psychological Measurement, 2018
Reliability is usually estimated for a total score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the repeatability of an individual item score in a group. Three methods to estimate item-score reliability are discussed, known as method MS, method [lambda][subscript 6], and method CA. The item-score…
Descriptors: Test Items, Test Reliability, Correlation, Comparative Analysis
Raykov, Tenko; Goldammer, Philippe; Marcoulides, George A.; Li, Tatyana; Menold, Natalja – Educational and Psychological Measurement, 2018
A readily applicable procedure is discussed that allows evaluation of the discrepancy between the popular coefficient alpha and the reliability coefficient of a scale with second-order factorial structure that is frequently of relevance in empirical educational and psychological research. The approach is developed within the framework of the…
Descriptors: Test Reliability, Factor Structure, Statistical Analysis, Computation
Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018
Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…
Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales
Hua, Yi – Canadian Journal of School Psychology, 2018
This article describes and reviews the "Test of Early Communication and Emerging Language" (TECEL; Huer & Miller, 2011). The test was constructed to assess infants' and toddlers' earliest communication and language abilities. The TECEL is a revision of the Nonspeech Test for Receptive/ Expressive Language (NST; Huer, 1983, 1988). The…
Descriptors: Language Acquisition, Correlation, Content Validity, Infants
Folkestad, James E.; McKernan, Brian; Train, Stephanie; Martey, Rosa Mikeal; Rhodes, Matthew G.; Kenski, Kate; Shaw, Adrienne; Stromer-Galley, Jennifer; Clegg, Benjamin A.; Strzalkowski, Tomek – Technology, Knowledge and Learning, 2018
The engaging nature of video games has intrigued learning professionals attempting to capture and retain learners' attention. Designing learning interventions that not only capture the learner's attention, but also are designed around the natural cycle of attention will be vital for learning. This paper introduces the temporal attentive…
Descriptors: Attention, Measures (Individuals), Video Games, Learner Engagement
Rae, Guenevere; Newman, William P., III; McGoey, Robin; Donthamsetty, Supriya; Karpinski, Aryn C.; Green, Jeffrey – Anatomical Sciences Education, 2018
The purpose of this study was to examine the histopathologic reliability of embalmed cadaveric tissue taken from the gross anatomy laboratory. Tissue samples from hearts, livers, lungs, and kidneys were collected after the medical students' dissection course was completed. All of the cadavers were embalmed in a formalin-based fixative solution.…
Descriptors: Anatomy, Physiology, Pathology, Reliability
Taber, Keith S. – Research in Science Education, 2018
Cronbach's alpha is a statistic commonly quoted by authors to demonstrate that tests and scales that have been constructed or adopted for research projects are fit for purpose. Cronbach's alpha is regularly adopted in studies in science education: it was referred to in 69 different papers published in 4 leading science education journals in a…
Descriptors: Educational Research, Science Education, Statistics, Measures (Individuals)
Fan, Chung-Hau; Zhang, Yanchen; Cook, Clayton R.; Yang, Nai-Jiin – Journal of Applied School Psychology, 2018
Response to intervention (RtI) models have increasingly been adopted to improve outcomes for all students through the delivery of a continuum of supports and making timely responsive instructional decisions based on data. With this increasing popularity, researchers and practitioners have developed several RtI-related assessments, many of which…
Descriptors: Factor Structure, Surveys, Response to Intervention, Readiness
Yocarini, Iris E.; Bouwmeester, Samantha; Smeets, Guus; Arends, Lidia R. – Educational Measurement: Issues and Practice, 2018
This real-data-guided simulation study systematically evaluated the decision accuracy of complex decision rules combining multiple tests within different realistic curricula. Specifically, complex decision rules combining conjunctive aspects and compensatory aspects were evaluated. A conjunctive aspect requires a minimum level of performance,…
Descriptors: Comparative Analysis, Decision Making, Accuracy, Higher Education
Traynor, A.; Merzdorf, H. E. – Educational Measurement: Issues and Practice, 2018
During the development of large-scale curricular achievement tests, recruited panels of independent subject-matter experts use systematic judgmental methods--often collectively labeled "alignment" methods--to rate the correspondence between a given test's items and the objective statements in a particular curricular standards document.…
Descriptors: Achievement Tests, Expertise, Alignment (Education), Test Items
Dijkhuizen, Annemarie; Douma, Rob K.; Krijnen, Wim P.; van der Schans, Cees P.; Waninge, Aly – Journal of Applied Research in Intellectual Disabilities, 2018
Background: A feasible and reliable instrument to measure strength in persons with severe intellectual and visual disabilities (SIVD) is lacking. The aim of our study was to determine feasibility, learning period and reliability of three strength tests. Methods: Twenty-nine participants with SIVD performed the Minimum Sit-to-Stand Height test…
Descriptors: Intellectual Disability, Visual Impairments, Muscular Strength, Reliability
Kershree Padayachee; M. Matimolane – Teaching in Higher Education, 2025
In the shift to Emergency Remote Teaching and Learning (ERT&L) during the COVID-19 pandemic, remote assessment and feedback became a major source of discontent and challenge for students and staff. This paper is a reflection and analysis of assessment practices during ERT&L, and our theorisation of the possibilities for shifts towards…
Descriptors: Educational Quality, Social Justice, Distance Education, Feedback (Response)
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Amir Reza Rahimi; Zahra Mosalli – Smart Learning Environments, 2025
Researchers have significantly explored language learners' attitudes toward ChatGPT through the lens of technology acceptance models, particularly with its development and integration into computer-assisted language learning (CALL). However, further research in this area is necessary to apply a theoretical framework with a pedagogical-oriented…
Descriptors: Second Language Learning, Second Language Instruction, Artificial Intelligence, Technology Uses in Education
Karolina Urton; Mariola Moeyaert; Kerstin Nobel; Anne Barwasser; Richard T. Boon; Matthias Grünke – Exceptionality, 2025
This study employed a meta-analytic approach to examine the effectiveness of graphic organizers (GOs) in improving academic and behavioral outcomes for K-12 students with disabilities, drawing from the single-case special education literature. Moderators at participant and study level were analyzed in addition to the main effects. A comprehensive…
Descriptors: Instructional Materials, Students with Disabilities, Instructional Effectiveness, Academic Achievement

Peer reviewed
Direct link
