Publication Date
| In 2026 | 0 |
| Since 2025 | 7 |
| Since 2022 (last 5 years) | 31 |
| Since 2017 (last 10 years) | 85 |
Descriptor
| Test Reliability | 85 |
| Test Validity | 49 |
| Data Collection | 33 |
| Data Analysis | 24 |
| Test Construction | 22 |
| Foreign Countries | 21 |
| Scores | 16 |
| Item Response Theory | 14 |
| Research Methodology | 13 |
| Data | 12 |
| Data Use | 12 |
| More ▼ | |
Source
Author
| Blaker, Lisa | 2 |
| Cutumisu, Maria | 2 |
| Erica S. Lembke | 2 |
| Kristen L. McMaster | 2 |
| Lê, Thanh | 2 |
| Manjary Guha | 2 |
| Najarian, Michelle | 2 |
| Nord, Christine | 2 |
| Seohyeon Choi | 2 |
| Tourangeau, Karen | 2 |
| Vaden-Kiernan, Nancy | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 4 |
| Practitioners | 2 |
Location
| Turkey | 4 |
| Chile | 2 |
| New York | 2 |
| Spain | 2 |
| Sweden | 2 |
| Austria | 1 |
| California | 1 |
| Ecuador | 1 |
| Finland | 1 |
| Germany | 1 |
| Hungary | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Duncan Culbreth; Rebekah Davis; Cigdem Meral; Florence Martin; Weichao Wang; Sejal Foxx – TechTrends: Linking Research and Practice to Improve Learning, 2025
Monitoring applications (MAs) use digital and online tools to collect and track data on student behavior, and they have become increasingly popular among schools. Empirical research on these complex surveillance platforms is scant, and little is known about the efficacy or impact that they have on students. This study used a multi-method…
Descriptors: High School Students, COVID-19, Pandemics, Progress Monitoring
Philip E. Kearney; Niamh Curran; Frank J. Nugent – Journal of Motor Learning and Development, 2025
Manipulation checks are an essential component of quality experimental design in motor learning. Guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses framework, this methodological systematic review examined the utilization of manipulation checks in focus of attention research. Seventy-eight protocols from four…
Descriptors: Attention Control, Attention Span, Motor Development, Psychomotor Skills
Gerald Gartlehner; Leila Kahwati; Rainer Hilscher; Ian Thomas; Shannon Kugley; Karen Crotty; Meera Viswanathan; Barbara Nussbaumer-Streit; Graham Booth; Nathaniel Erskine; Amanda Konet; Robert Chew – Research Synthesis Methods, 2024
Data extraction is a crucial, yet labor-intensive and error-prone part of evidence synthesis. To date, efforts to harness machine learning for enhancing efficiency of the data extraction process have fallen short of achieving sufficient accuracy and usability. With the release of large language models (LLMs), new possibilities have emerged to…
Descriptors: Data Collection, Evidence, Synthesis, Language Processing
Adrian Adams; Lauren Barth-Cohen – CBE - Life Sciences Education, 2024
In undergraduate research settings, students are likely to encounter anomalous data, that is, data that do not meet their expectations. Most of the research that directly or indirectly captures the role of anomalous data in research settings uses post-hoc reflective interviews or surveys. These data collection approaches focus on recall of past…
Descriptors: Undergraduate Students, Physics, Science Instruction, Laboratory Experiments
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025
Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…
Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes
Catalina Patricia Morales-Murillo; Manuel Pacheco-Molero; Irene León-Estrada; Rosa Fernández-Valero; Mónica Gutiérrez Ortega; R. A. McWilliam – Early Childhood Education Journal, 2025
This study analyzed the fit of data collected with the Measure of Engagement, Independence, and Social Relationships for 3- to 5-year-olds (MEISR 3-to-5-years-old) to a proposed theoretical model based on the cross-walk of MEISR 3-to-5-years-old items and codes from 7 chapters of the Activities and Participation component of the International…
Descriptors: Foreign Countries, Early Childhood Education, Intervention, Preschool Children
Öz, Serap; Özdemir, Ali – International Journal of Contemporary Educational Research, 2022
The purpose of this study is to develop a valid and reliable Likert-type scale that can be used to measure the data literacy skills of educators. In the development process of the scale, after reviewing the relevant literature, a pool of 130 items was designed and presented to the experts for their view. After the evaluation of experts, the…
Descriptors: Likert Scales, Test Construction, Construct Validity, Test Reliability
Foster, Robert C. – Educational and Psychological Measurement, 2021
This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…
Descriptors: Test Reliability, Data, Computation, Mathematical Formulas
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024
There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…
Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education
Lee, Yi-Hsuan; Haberman, Shelby J. – Journal of Educational Measurement, 2021
For assessments that use different forms in different administrations, equating methods are applied to ensure comparability of scores over time. Ideally, a score scale is well maintained throughout the life of a testing program. In reality, instability of a score scale can result from a variety of causes, some are expected while others may be…
Descriptors: Scores, Regression (Statistics), Demography, Data
Benjawan Plengkham; Sonthaya Rattanasak; Patsawut Sukserm – Journal of Education and Learning, 2025
This academic article provides the essential steps for designing an effective English questionnaire in social science research, with a focus on ensuring clarity, cultural sensitivity and ethical integrity. Developed from key insights from related studies, it outlines potential practice in questionnaire design, item development and the importance…
Descriptors: Guidelines, Test Construction, Questionnaires, Surveys
Huebner, Alan; Lucht, Marissa – Practical Assessment, Research & Evaluation, 2019
Generalizability theory is a modern, powerful, and broad framework used to assess the reliability, or dependability, of measurements. While there exist classic works that explain the basic concepts and mathematical foundations of the method, there is currently a lack of resources addressing computational resources for those researchers wishing to…
Descriptors: Generalizability Theory, Test Reliability, Computer Software, Statistical Analysis

Peer reviewed
Direct link
