Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 23 |
| Since 2017 (last 10 years) | 51 |
| Since 2007 (last 20 years) | 103 |
Descriptor
| Test Reliability | 237 |
| Test Validity | 129 |
| Data Analysis | 84 |
| Data Collection | 75 |
| Test Construction | 59 |
| Tables (Data) | 51 |
| Foreign Countries | 43 |
| Evaluation Methods | 42 |
| Scores | 32 |
| Statistical Data | 29 |
| Correlation | 28 |
| More ▼ | |
Source
Author
| DiLuzio, Geneva J. | 4 |
| Capie, William | 3 |
| Stallings, Jane A. | 3 |
| Algozzine, Bob | 2 |
| Algozzine, Kate | 2 |
| Birenbaum, Menucha | 2 |
| Cusumano, Dale | 2 |
| Erica S. Lembke | 2 |
| Hines, Constance V. | 2 |
| Ho, Andrew D. | 2 |
| Horner, Robert H. | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 14 |
| Practitioners | 2 |
| Counselors | 1 |
| Teachers | 1 |
Location
| Turkey | 8 |
| Australia | 5 |
| Canada | 4 |
| California | 3 |
| Spain | 3 |
| Belgium | 2 |
| Chile | 2 |
| Colorado | 2 |
| Florida | 2 |
| Illinois | 2 |
| Singapore | 2 |
| More ▼ | |
Laws, Policies, & Programs
| Bilingual Education Act 1968 | 1 |
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Duncan Culbreth; Rebekah Davis; Cigdem Meral; Florence Martin; Weichao Wang; Sejal Foxx – TechTrends: Linking Research and Practice to Improve Learning, 2025
Monitoring applications (MAs) use digital and online tools to collect and track data on student behavior, and they have become increasingly popular among schools. Empirical research on these complex surveillance platforms is scant, and little is known about the efficacy or impact that they have on students. This study used a multi-method…
Descriptors: High School Students, COVID-19, Pandemics, Progress Monitoring
Gerald Gartlehner; Leila Kahwati; Rainer Hilscher; Ian Thomas; Shannon Kugley; Karen Crotty; Meera Viswanathan; Barbara Nussbaumer-Streit; Graham Booth; Nathaniel Erskine; Amanda Konet; Robert Chew – Research Synthesis Methods, 2024
Data extraction is a crucial, yet labor-intensive and error-prone part of evidence synthesis. To date, efforts to harness machine learning for enhancing efficiency of the data extraction process have fallen short of achieving sufficient accuracy and usability. With the release of large language models (LLMs), new possibilities have emerged to…
Descriptors: Data Collection, Evidence, Synthesis, Language Processing
Adrian Adams; Lauren Barth-Cohen – CBE - Life Sciences Education, 2024
In undergraduate research settings, students are likely to encounter anomalous data, that is, data that do not meet their expectations. Most of the research that directly or indirectly captures the role of anomalous data in research settings uses post-hoc reflective interviews or surveys. These data collection approaches focus on recall of past…
Descriptors: Undergraduate Students, Physics, Science Instruction, Laboratory Experiments
Mihyun Son; Minsu Ha – Education and Information Technologies, 2025
Digital literacy is essential for scientific literacy in a digital world. Although the NGSS Practices include many activities that require digital literacy, most studies have examined digital literacy from a generic perspective rather than a curricular context. This study aimed to develop a self-report tool to measure elements of digital literacy…
Descriptors: Test Construction, Measures (Individuals), Digital Literacy, Scientific Literacy
Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025
Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…
Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes
Catalina Patricia Morales-Murillo; Manuel Pacheco-Molero; Irene León-Estrada; Rosa Fernández-Valero; Mónica Gutiérrez Ortega; R. A. McWilliam – Early Childhood Education Journal, 2025
This study analyzed the fit of data collected with the Measure of Engagement, Independence, and Social Relationships for 3- to 5-year-olds (MEISR 3-to-5-years-old) to a proposed theoretical model based on the cross-walk of MEISR 3-to-5-years-old items and codes from 7 chapters of the Activities and Participation component of the International…
Descriptors: Foreign Countries, Early Childhood Education, Intervention, Preschool Children
Öz, Serap; Özdemir, Ali – International Journal of Contemporary Educational Research, 2022
The purpose of this study is to develop a valid and reliable Likert-type scale that can be used to measure the data literacy skills of educators. In the development process of the scale, after reviewing the relevant literature, a pool of 130 items was designed and presented to the experts for their view. After the evaluation of experts, the…
Descriptors: Likert Scales, Test Construction, Construct Validity, Test Reliability
Foster, Robert C. – Educational and Psychological Measurement, 2021
This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…
Descriptors: Test Reliability, Data, Computation, Mathematical Formulas
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024
There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…
Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education
Yuting Han; Zhehan Jiang; Lingling Xu; Fen Cai – AERA Online Paper Repository, 2024
To address the computational constraints of parameter estimation in the polytomous Cognitive Diagnosis Model (pCDM) in large-scale high data volume situations, this study proposes two two-stage polytomous attribute estimation methods: P_max and P_linear. The effects of the two-stage methods were studied via a Monte Carlo simulation study, and the…
Descriptors: Medical Education, Licensing Examinations (Professions), Measurement Techniques, Statistical Data
Elizabeth B. Vaughan; A. Montoya-Cowan; Jack Barbera – Chemistry Education Research and Practice, 2024
The Meaningful Learning in the Laboratory Instrument (MLLI) was designed to measure students' expectations before and after their laboratory courses and experiences. Although the MLLI has been used in various studies and laboratory environments to investigate students' cognitive and affective laboratory expectations, the authors of the instrument…
Descriptors: Test Validity, Test Reliability, Expectation, Measures (Individuals)
van Alphen, Thijmen; Jak, Suzanne; Jansen in de Wal, Joost; Schuitema, Jaap; Peetsma, Thea – Applied Measurement in Education, 2022
Intensive longitudinal data is increasingly used to study state-like processes such as changes in daily stress. Measures aimed at collecting such data require the same level of scrutiny regarding scale reliability as traditional questionnaires. The most prevalent methods used to assess reliability of intensive longitudinal measures are based on…
Descriptors: Test Reliability, Measures (Individuals), Anxiety, Data Collection
Seohyeon Choi; Kristen L. McMaster; Erica S. Lembke; Manjary Guha – Grantee Submission, 2024
Teachers' knowledge and skills about data-based instruction (DBI) can influence their self-efficacy and their implementation of DBI with fidelity, ultimately playing a crucial role in improving student outcomes. The purpose of this brief report is to provide evidence for the technical adequacy of a measure of DBI knowledge and skills in writing by…
Descriptors: Data Use, Writing Instruction, Knowledge Level, Elementary School Teachers
Seohyeon Choi; Kristen L. McMaster; Erica S. Lembke; Manjary Guha – Assessment for Effective Intervention, 2024
Teachers' knowledge and skills about data-based instruction (DBI) can influence their self-efficacy and their implementation of DBI with fidelity, ultimately playing a crucial role in improving student outcomes. The purpose of this brief report is to provide evidence for the technical adequacy of a measure of DBI knowledge and skills in writing by…
Descriptors: Data Use, Writing Instruction, Knowledge Level, Elementary School Teachers

Peer reviewed
Direct link
