Publication Date
| In 2026 | 0 |
| Since 2025 | 9 |
| Since 2022 (last 5 years) | 35 |
Descriptor
| Evaluation Methods | 35 |
| Programming Languages | 35 |
| Computer Science Education | 10 |
| Computer Software | 10 |
| Models | 8 |
| Accuracy | 7 |
| Foreign Countries | 7 |
| Bayesian Statistics | 6 |
| Programming | 6 |
| Simulation | 6 |
| Algorithms | 5 |
| More ▼ | |
Source
Author
| Barnes, Tiffany | 2 |
| Chi, Min | 2 |
| Pere J. Ferrando | 2 |
| Shi, Yang | 2 |
| Tan, Teck Kiang | 2 |
| Achutti, Camila F. | 1 |
| Aleksandar D. Kovacevic | 1 |
| Ana Hernández-Dorado | 1 |
| Anders Sjöberg | 1 |
| Anna-Bettina Haidich | 1 |
| Asmaa Bengueddach | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 28 |
| Reports - Research | 25 |
| Reports - Descriptive | 5 |
| Speeches/Meeting Papers | 4 |
| Reports - Evaluative | 3 |
| Tests/Questionnaires | 2 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
Education Level
| Higher Education | 10 |
| Postsecondary Education | 10 |
| Secondary Education | 6 |
| High Schools | 3 |
| Junior High Schools | 3 |
| Middle Schools | 3 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Joyce M. W. Moonen-van Loon; Jeroen Donkers – Practical Assessment, Research & Evaluation, 2025
The reliability of assessment tools is critical for accurately monitoring student performance in various educational contexts. When multiple assessments are combined to form an overall evaluation, each assessment serves as a data point contributing to the student's performance within a broader educational framework. Determining composite…
Descriptors: Programming Languages, Reliability, Evaluation Methods, Student Evaluation
Suppanut Sriutaisuk; Yu Liu; Seungwon Chung; Hanjoe Kim; Fei Gu – Educational and Psychological Measurement, 2025
The multiple imputation two-stage (MI2S) approach holds promise for evaluating the model fit of structural equation models for ordinal variables with multiply imputed data. However, previous studies only examined the performance of MI2S-based residual-based test statistics. This study extends previous research by examining the performance of two…
Descriptors: Structural Equation Models, Error of Measurement, Programming Languages, Goodness of Fit
Pere J. Ferrando; Ana Hernández-Dorado; Urbano Lorenzo-Seva – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A frequent criticism of exploratory factor analysis (EFA) is that it does not allow correlated residuals to be modelled, while they can be routinely specified in the confirmatory (CFA) model. In this article, we propose an EFA approach in which both the common factor solution and the residual matrix are unrestricted (i.e., the correlated residuals…
Descriptors: Correlation, Factor Analysis, Models, Goodness of Fit
Xiao Liu; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
In psychology, researchers are often interested in testing hypotheses about mediation, such as testing the presence of a mediation effect of a treatment (e.g., intervention assignment) on an outcome via a mediator. An increasingly popular approach to testing hypotheses is the Bayesian testing approach with Bayes factors (BFs). Despite the growing…
Descriptors: Sample Size, Bayesian Statistics, Programming Languages, Simulation
Christina Glasauer; Martin K. Yeh; Lois Anne DeLong; Yu Yan; Yanyan Zhuang – Computer Science Education, 2025
Background and Context: Feedback on one's progress is essential to new programming language learners, particularly in out-of-classroom settings. Though many study materials offer assessment mechanisms, most do not examine the accuracy of the feedback they deliver, nor give evidence on its validity. Objective: We investigate the potential use of a…
Descriptors: Novices, Computer Science Education, Programming, Accuracy
Bougioukas, Konstantinos I.; Diakonidis, Theodoros; Mavromanoli, Anna C.; Haidich, Anna-Bettina – Research Synthesis Methods, 2023
An overview of reviews aims to collect, assess, and synthesize evidence from multiple systematic reviews (SRs) on a specific topic using rigorous and reproducible methods. An important methodological challenge in conducting an overview of reviews is the management of overlapping data due to the inclusion of the same primary studies in SRs. We…
Descriptors: Programming Languages, Open Source Technology, Evaluation Methods, Evidence
Konstantinos I. Bougioukas; Paschalis Karakasis; Konstantinos Pamporis; Emmanouil Bouras; Anna-Bettina Haidich – Research Synthesis Methods, 2024
Systematic reviews (SRs) have an important role in the healthcare decision-making practice. Assessing the overall confidence in the results of SRs using quality assessment tools, such as "A MeaSurement Tool to Assess Systematic Reviews 2" (AMSTAR 2), is crucial since not all SRs are conducted using the most rigorous methods. In this…
Descriptors: Programming Languages, Research Methodology, Decision Making, Medical Research
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2023
Researchers often have hypotheses concerning the state of affairs in the population from which they sampled their data to compare group means. The classical frequentist approach provides one way of carrying out hypothesis testing using ANOVA to state the null hypothesis that there is no difference in the means and proceed with multiple comparisons…
Descriptors: Comparative Analysis, Hypothesis Testing, Statistical Analysis, Guidelines
Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025
This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…
Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism
Tan, Teck Kiang – Practical Assessment, Research & Evaluation, 2022
Power analysis based on the analytical t-test is an important aspect of a research study to determine the sample size required to detect the effect for the comparison of two means. The current paper presents a reader-friendly procedure for carrying out the t-test power analysis using the various R add-on packages. While there is a growing of R…
Descriptors: Programming Languages, Sample Size, Bayesian Statistics, Intervention
Kane Meissel; Esther S. Yao – Practical Assessment, Research & Evaluation, 2024
Effect sizes are important because they are an accessible way to indicate the practical importance of observed associations or differences. Standardized mean difference (SMD) effect sizes, such as Cohen's d, are widely used in education and the social sciences -- in part because they are relatively easy to calculate. However, SMD effect sizes…
Descriptors: Computer Software, Programming Languages, Effect Size, Correlation
Melina Verger; Chunyang Fan; Sébastien Lallé; François Bouchet; Vanda Luengo – Journal of Educational Data Mining, 2024
Predictive student models are increasingly used in learning environments due to their ability to enhance educational outcomes and support stakeholders in making informed decisions. However, predictive models can be biased and produce unfair outcomes, leading to potential discrimination against certain individuals and harmful long-term…
Descriptors: Algorithms, Prediction, Bias, Classification
Tamas Balla; Sandor Kiraly; Roland Kiraly – Discover Education, 2025
Educational games have gained widespread interest among teachers and researchers across various fields due to their capacity to engage students, foster active participation, and improve learning outcomes. In the context of computer programming, which demands significant cognitive effort, the use of educational games has grown substantially. While…
Descriptors: Educational Games, Gamification, Programming, Programming Languages
Van Lissa, Caspar J.; van Erp, Sara; Clapper, Eli-Boaz – Research Synthesis Methods, 2023
When meta-analyzing heterogeneous bodies of literature, meta-regression can be used to account for potentially relevant between-studies differences. A key challenge is that the number of candidate moderators is often high relative to the number of studies. This introduces risks of overfitting, spurious results, and model non-convergence. To…
Descriptors: Bayesian Statistics, Regression (Statistics), Maximum Likelihood Statistics, Meta Analysis
Erik Hombre Cuevas; Daniel Zaldivar; Marco Perez – International Journal of Information and Communication Technology Education, 2025
The integration of various programming languages into the undergraduate engineering curriculum often occurs without adequate evaluation of their effectiveness within specific disciplines. Recently, Python and MATLAB have garnered significant attention as preferred languages for teaching subjects such as image processing and computer vision.…
Descriptors: Influence of Technology, Technology Uses in Education, Programming Languages, Academic Achievement

Peer reviewed
Direct link
