Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 72 |
| Since 2017 (last 10 years) | 161 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Evaluation Methods | 966 |
| Classification | 959 |
| Foreign Countries | 174 |
| Models | 160 |
| Student Evaluation | 130 |
| Comparative Analysis | 106 |
| Higher Education | 96 |
| Evaluation Criteria | 92 |
| Elementary Secondary Education | 85 |
| Teaching Methods | 61 |
| Definitions | 60 |
| More ▼ | |
Source
Author
| Fuchs, Lynn S. | 5 |
| Timmons, Vianne | 5 |
| Fuchs, Douglas | 4 |
| Westphal, Laurie E. | 4 |
| Alkin, Marvin C. | 3 |
| Bachofer, Karen | 3 |
| Betts, Julian | 3 |
| Compton, Donald L. | 3 |
| Gresham, Frank M. | 3 |
| Hayes, Joseph | 3 |
| Hill, Laura | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 24 |
| Researchers | 24 |
| Teachers | 11 |
| Administrators | 7 |
| Counselors | 4 |
| Students | 4 |
| Policymakers | 3 |
| Parents | 2 |
| Community | 1 |
Location
| United Kingdom | 26 |
| Australia | 24 |
| Canada | 23 |
| United States | 15 |
| China | 8 |
| United Kingdom (England) | 8 |
| California | 7 |
| Israel | 7 |
| Netherlands | 7 |
| Germany | 6 |
| Florida | 5 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Putnikovic, Marko; Jovanovic, Jelena – IEEE Transactions on Learning Technologies, 2023
Automatic grading of short answers is an important task in computer-assisted assessment (CAA). Recently, embeddings, as semantic-rich textual representations, have been increasingly used to represent short answers and predict the grade. Despite the recent trend of applying embeddings in automatic short answer grading (ASAG), there are no…
Descriptors: Automation, Computer Assisted Testing, Grading, Natural Language Processing
Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023
Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…
Descriptors: Testing, Computation, Classification, Accuracy
Binici, Salih; Cuhadar, Ismail – Journal of Educational Measurement, 2022
Validity of performance standards is a key element for the defensibility of standard setting results, and validating performance standards requires collecting multiple pieces of evidence at every step during the standard setting process. This study employs a statistical procedure, latent class analysis, to set performance standards and compares…
Descriptors: Validity, Performance, Standards, Multivariate Analysis
Chih-Hsuan Chen; Chia-Ru Chung; Hsuan-Yu Yang; Shih-Ching Yeh; Eric Hsiao-Kuang Wu; Hsin-Jung Ting – IEEE Transactions on Learning Technologies, 2024
Possible symptoms of intellectual disability (ID) include delayed physical development that becomes more pronounced as the disability progresses, delayed development of gross and fine motor skills, sensory perception problems, and difficulty grasping the integrity of objects. Although there is no cure or reversal, research has shown that extensive…
Descriptors: Intellectual Disability, Disability Identification, Simulated Environment, Computer Simulation
Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024
Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…
Descriptors: Diagnostic Tests, Classification, Models, Psychometrics
Daniel McNeish; Patrick D. Manapat – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A recent review found that 11% of published factor models are hierarchical models with second-order factors. However, dedicated recommendations for evaluating hierarchical model fit have yet to emerge. Traditional benchmarks like RMSEA <0.06 or CFI >0.95 are often consulted, but they were never intended to generalize to hierarchical models.…
Descriptors: Factor Analysis, Goodness of Fit, Hierarchical Linear Modeling, Benchmarking
Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025
This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…
Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism
Stephanie Fuchs; Alexandra Werth; Cristóbal Méndez; Jonathan Butcher – Journal of Engineering Education, 2025
Background: High-quality feedback is crucial for academic success, driving student motivation and engagement while research explores effective delivery and student interactions. Advances in artificial intelligence (AI), particularly natural language processing (NLP), offer innovative methods for analyzing complex qualitative data such as feedback…
Descriptors: Artificial Intelligence, Training, Data Analysis, Natural Language Processing
Jihong Zhang – ProQuest LLC, 2022
Recently, Bayesian diagnostic classification modeling has been becoming popular in health psychology, education, and sociology. Typically information criteria are used for model selection when researchers want to choose the best model among alternative models. In Bayesian estimation, posterior predictive checking is a flexible Bayesian model…
Descriptors: Bayesian Statistics, Cognitive Measurement, Models, Classification
Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024
RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…
Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics
Melina Verger; Chunyang Fan; Sébastien Lallé; François Bouchet; Vanda Luengo – Journal of Educational Data Mining, 2024
Predictive student models are increasingly used in learning environments due to their ability to enhance educational outcomes and support stakeholders in making informed decisions. However, predictive models can be biased and produce unfair outcomes, leading to potential discrimination against certain individuals and harmful long-term…
Descriptors: Algorithms, Prediction, Bias, Classification
Region 15 Comprehensive Center, 2024
This brief emphasizes the importance of English learner reclassification in California, where over 19 percent of students are classified as English learners. It outlines the high-stakes nature of this process, particularly under the Every Student Succeeds Act (ESSA), which mandates uniform criteria for exiting English learner programs. The four…
Descriptors: English Learners, Classification, Language Proficiency, Teacher Attitudes
Liang, Xinya; Cao, Chunhua – Journal of Experimental Education, 2023
To evaluate multidimensional factor structure, a popular method that combines features of confirmatory and exploratory factor analysis is Bayesian structural equation modeling with small-variance normal priors (BSEM-N). This simulation study evaluated BSEM-N as a variable selection and parameter estimation tool in factor analysis with sparse…
Descriptors: Factor Analysis, Bayesian Statistics, Structural Equation Models, Simulation
Van Camp, Carole M.; Batchelder, Sydney R.; Irwin Helvey, Casey – Journal of Applied Behavior Analysis, 2022
Children should engage in 1 hr/day of moderate-to-vigorous physical activity (MVPA) that results in increased heart rates (HRs) (CDC, 2022). However, precise individualized HR criteria for MVPA are not provided, and it is unclear whether observed behaviors classified as MVPA are associated with elevated HRs indicative of MVPA. The current study…
Descriptors: Metabolism, Physical Activity Level, Identification, Classification
Lishan Zhang; Linyu Deng; Sixv Zhang; Ling Chen – IEEE Transactions on Learning Technologies, 2024
With the popularity of online one-to-one tutoring, there are emerging concerns about the quality and effectiveness of this kind of tutoring. Although there are some evaluation methods available, they are heavily relied on manual coding by experts, which is too costly. Therefore, using machine learning to predict instruction quality automatically…
Descriptors: Automation, Classification, Artificial Intelligence, Tutoring

Peer reviewed
Direct link
