Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 28 |
Descriptor
Comparative Analysis | 28 |
Statistical Analysis | 28 |
Computation | 18 |
Sample Size | 7 |
Models | 6 |
Hierarchical Linear Modeling | 5 |
Regression (Statistics) | 5 |
Simulation | 5 |
Statistical Bias | 5 |
Effect Size | 4 |
Foreign Countries | 4 |
More ▼ |
Source
Journal of Educational and… | 28 |
Author
Lüdtke, Oliver | 2 |
Robitzsch, Alexander | 2 |
Aseltine, Robert H., Jr. | 1 |
Avi Feller | 1 |
Azen, Razia | 1 |
Benjamin Lu | 1 |
Beretvas, S. Natasha | 1 |
Bolsinova, Maria | 1 |
Bonnet, Gerard | 1 |
Botella, Juan | 1 |
Béguin, Anton A. | 1 |
More ▼ |
Publication Type
Journal Articles | 28 |
Reports - Research | 19 |
Reports - Evaluative | 6 |
Reports - Descriptive | 3 |
Education Level
Secondary Education | 5 |
Adult Education | 1 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 1 | 1 |
High Schools | 1 |
Kindergarten | 1 |
Primary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
Center for Epidemiologic… | 1 |
Early Childhood Longitudinal… | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Joo, Seang-Hwane; Wang, Yan; Ferron, John; Beretvas, S. Natasha; Moeyaert, Mariola; Van Den Noortgate, Wim – Journal of Educational and Behavioral Statistics, 2022
Multiple baseline (MB) designs are becoming more prevalent in educational and behavioral research, and as they do, there is growing interest in combining effect size estimates across studies. To further refine the meta-analytic methods of estimating the effect, this study developed and compared eight alternative methods of estimating intervention…
Descriptors: Meta Analysis, Effect Size, Computation, Statistical Analysis
Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022
One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…
Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis
Crompvoets, Elise A. V.; Béguin, Anton A.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2020
Pairwise comparison is becoming increasingly popular as a holistic measurement method in education. Unfortunately, many comparisons are required for reliable measurement. To reduce the number of required comparisons, we developed an adaptive selection algorithm (ASA) that selects the most informative comparisons while taking the uncertainty of the…
Descriptors: Comparative Analysis, Statistical Analysis, Mathematics, Measurement
Benjamin Lu; Eli Ben-Michael; Avi Feller; Luke Miratrix – Journal of Educational and Behavioral Statistics, 2023
In multisite trials, learning about treatment effect variation across sites is critical for understanding where and for whom a program works. Unadjusted comparisons, however, capture "compositional" differences in the distributions of unit-level features as well as "contextual" differences in site-level features, including…
Descriptors: Statistical Analysis, Statistical Distributions, Program Implementation, Comparative Analysis
Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2022
This article develops new closed-form variance expressions for power analyses for commonly used difference-in-differences (DID) and comparative interrupted time series (CITS) panel data estimators. The main contribution is to incorporate variation in treatment timing into the analysis. The power formulas also account for other key design features…
Descriptors: Comparative Analysis, Statistical Analysis, Sample Size, Measurement Techniques
Erps, Ryan C.; Noguchi, Kimihiro – Journal of Educational and Behavioral Statistics, 2020
A new two-sample test for comparing variability measures is proposed. To make the test robust and powerful, a new modified structural zero removal method is applied to the Brown-Forsythe transformation. The t-test-based statistic allows results to be expressed as the ratio of mean absolute deviations from median. Extensive simulation study…
Descriptors: Statistical Analysis, Comparative Analysis, Robustness (Statistics), Sample Size
Mistler, Stephen A.; Enders, Craig K. – Journal of Educational and Behavioral Statistics, 2017
Multiple imputation methods can generally be divided into two broad frameworks: joint model (JM) imputation and fully conditional specification (FCS) imputation. JM draws missing values simultaneously for all incomplete variables using a multivariate distribution, whereas FCS imputes variables one at a time from a series of univariate conditional…
Descriptors: Statistical Analysis, Comparative Analysis, Hierarchical Linear Modeling, Computer Simulation
Hong, Guanglei; Qin, Xu; Yang, Fan – Journal of Educational and Behavioral Statistics, 2018
Through a sensitivity analysis, the analyst attempts to determine whether a conclusion of causal inference could be easily reversed by a plausible violation of an identification assumption. Analytic conclusions that are harder to alter by such a violation are expected to add a higher value to scientific knowledge about causality. This article…
Descriptors: Statistical Inference, Probability, Statistical Bias, Statistical Analysis
Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2018
Multiple imputation (MI) can be used to address missing data at Level 2 in multilevel research. In this article, we compare joint modeling (JM) and the fully conditional specification (FCS) of MI as well as different strategies for including auxiliary variables at Level 1 using either their manifest or their latent cluster means. We show with…
Descriptors: Statistical Analysis, Data, Comparative Analysis, Hierarchical Linear Modeling
Savalei, Victoria; Rhemtulla, Mijke – Journal of Educational and Behavioral Statistics, 2017
In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately…
Descriptors: Computation, Statistical Analysis, Test Items, Maximum Likelihood Statistics
Magis, David; Tuerlinckx, Francis; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2015
This article proposes a novel approach to detect differential item functioning (DIF) among dichotomously scored items. Unlike standard DIF methods that perform an item-by-item analysis, we propose the "LR lasso DIF method": logistic regression (LR) model is formulated for all item responses. The model contains item-specific intercepts,…
Descriptors: Test Bias, Test Items, Regression (Statistics), Scores
Tipton, Elizabeth – Journal of Educational and Behavioral Statistics, 2014
Although a large-scale experiment can provide an estimate of the average causal impact for a program, the sample of sites included in the experiment is often not drawn randomly from the inference population of interest. In this article, we provide a generalizability index that can be used to assess the degree of similarity between the sample of…
Descriptors: Experiments, Comparative Analysis, Experimental Groups, Generalization
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016
Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…
Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit
Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016
Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…
Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)
Jan, Show-Li; Shieh, Gwowen – Journal of Educational and Behavioral Statistics, 2014
The analysis of variance (ANOVA) is one of the most frequently used statistical analyses in practical applications. Accordingly, the single and multiple comparison procedures are frequently applied to assess the differences among mean effects. However, the underlying assumption of homogeneous variances may not always be tenable. This study…
Descriptors: Sample Size, Statistical Analysis, Computation, Probability
Previous Page | Next Page »
Pages: 1 | 2