Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 21 |
Descriptor
Probability | 23 |
Statistical Analysis | 23 |
Computation | 9 |
Models | 7 |
Monte Carlo Methods | 5 |
Comparative Analysis | 4 |
Simulation | 4 |
Statistical Bias | 4 |
Test Items | 4 |
Cheating | 3 |
Equated Scores | 3 |
More ▼ |
Source
Journal of Educational and… | 23 |
Author
Hong, Guanglei | 2 |
Qin, Xu | 2 |
Tipton, Elizabeth | 2 |
Becker, Kirsten | 1 |
Collins, Rebecca L. | 1 |
D'Amico, Elizabeth J. | 1 |
Feller, Avi | 1 |
Fox, Jean-Paul | 1 |
Frank, Kenneth A. | 1 |
Garcia-Perez, Miguel A. | 1 |
González, Jorge | 1 |
More ▼ |
Publication Type
Journal Articles | 23 |
Reports - Research | 17 |
Reports - Evaluative | 4 |
Reports - Descriptive | 2 |
Education Level
Higher Education | 4 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Early Childhood Education | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Audience
Location
California | 1 |
California (Los Angeles) | 1 |
California (Riverside) | 1 |
Netherlands | 1 |
Netherlands (Amsterdam) | 1 |
Sweden | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
Early Childhood Longitudinal… | 1 |
National Longitudinal Study… | 1 |
What Works Clearinghouse Rating
San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022
The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…
Descriptors: Tests, Scores, Statistical Analysis, Models
Sinharay, Sandip; Johnson, Matthew S. – Journal of Educational and Behavioral Statistics, 2021
Score differencing is one of the six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…
Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis
Kuijpers, Renske E.; Visser, Ingmar; Molenaar, Dylan – Journal of Educational and Behavioral Statistics, 2021
Mixture models have been developed to enable detection of within-subject differences in responses and response times to psychometric test items. To enable mixture modeling of both responses and response times, a distributional assumption is needed for the within-state response time distribution. Since violations of the assumed response time…
Descriptors: Test Items, Responses, Reaction Time, Models
Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2019
When equating two test forms, the equated scores will be biased if the test groups differ in ability. To adjust for the ability imbalance between nonequivalent groups, a set of common items is often used. When no common items are available, it has been suggested to use covariates correlated with the test scores instead. In this article, we reduce…
Descriptors: Equated Scores, Test Items, Probability, College Entrance Examinations
Nguyen, Trang Quynh; Stuart, Elizabeth A. – Journal of Educational and Behavioral Statistics, 2020
We address measurement error bias in propensity score (PS) analysis due to covariates that are latent variables. In the setting where latent covariate X is measured via multiple error-prone items W, PS analysis using several proxies for X--the W items themselves, a summary score (mean/sum of the items), or the conventional factor score (i.e.,…
Descriptors: Error of Measurement, Statistical Bias, Error Correction, Probability
Hong, Guanglei; Qin, Xu; Yang, Fan – Journal of Educational and Behavioral Statistics, 2018
Through a sensitivity analysis, the analyst attempts to determine whether a conclusion of causal inference could be easily reversed by a plausible violation of an identification assumption. Analytic conclusions that are harder to alter by such a violation are expected to add a higher value to scientific knowledge about causality. This article…
Descriptors: Statistical Inference, Probability, Statistical Bias, Statistical Analysis
Qin, Xu; Hong, Guanglei – Journal of Educational and Behavioral Statistics, 2017
When a multisite randomized trial reveals between-site variation in program impact, methods are needed for further investigating heterogeneous mediation mechanisms across the sites. We conceptualize and identify a joint distribution of site-specific direct and indirect effects under the potential outcomes framework. A method-of-moments procedure…
Descriptors: Randomized Controlled Trials, Hierarchical Linear Modeling, Statistical Analysis, Probability
Feller, Avi; Mealli, Fabrizia; Miratrix, Luke – Journal of Educational and Behavioral Statistics, 2017
Researchers addressing posttreatment complications in randomized trials often turn to principal stratification to define relevant assumptions and quantities of interest. One approach for the subsequent estimation of causal effects in this framework is to use methods based on the "principal score," the conditional probability of belonging…
Descriptors: Scores, Probability, Computation, Program Evaluation
Kovalchik, Stephanie A.; Martino, Steven C.; Collins, Rebecca L.; Shadel, William G.; D'Amico, Elizabeth J.; Becker, Kirsten – Journal of Educational and Behavioral Statistics, 2018
Ecological momentary assessment (EMA) is a popular assessment method in psychology that aims to capture events, emotions, and cognitions in real time, usually repeatedly throughout the day. Because EMA typically involves more intensive monitoring than traditional assessment methods, missing data are commonly an issue and this missingness may bias…
Descriptors: Probability, Statistical Bias, Holistic Approach, Evaluation Methods
Keller, Bryan; Tipton, Elizabeth – Journal of Educational and Behavioral Statistics, 2016
In this article, we review four software packages for implementing propensity score analysis in R: "Matching, MatchIt, PSAgraphics," and "twang." After briefly discussing essential elements for propensity score analysis, we apply each package to a data set from the Early Childhood Longitudinal Study in order to estimate the…
Descriptors: Computer Software, Probability, Statistical Analysis, Longitudinal Studies
Tipton, Elizabeth – Journal of Educational and Behavioral Statistics, 2014
Although a large-scale experiment can provide an estimate of the average causal impact for a program, the sample of sites included in the experiment is often not drawn randomly from the inference population of interest. In this article, we provide a generalizability index that can be used to assess the degree of similarity between the sample of…
Descriptors: Experiments, Comparative Analysis, Experimental Groups, Generalization
Sweet, Tracy M. – Journal of Educational and Behavioral Statistics, 2015
Social networks in education commonly involve some form of grouping, such as friendship cliques or teacher departments, and blockmodels are a type of statistical social network model that accommodate these grouping or blocks by assuming different within-group tie probabilities than between-group tie probabilities. We describe a class of models,…
Descriptors: Social Networks, Statistical Analysis, Probability, Models
Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2015
An equating procedure for a testing program with evolving distribution of examinee profiles is developed. No anchor is available because the original scoring scheme was based on expert judgment of the item difficulties. Pairs of examinees from two administrations are formed by matching on coarsened propensity scores derived from a set of…
Descriptors: Equated Scores, Testing Programs, College Entrance Examinations, Scoring
Jan, Show-Li; Shieh, Gwowen – Journal of Educational and Behavioral Statistics, 2014
The analysis of variance (ANOVA) is one of the most frequently used statistical analyses in practical applications. Accordingly, the single and multiple comparison procedures are frequently applied to assess the differences among mean effects. However, the underlying assumption of homogeneous variances may not always be tenable. This study…
Descriptors: Sample Size, Statistical Analysis, Computation, Probability
Guanglei Hong; Jonah Deutsch; Heather D. Hill – Journal of Educational and Behavioral Statistics, 2015
Conventional methods for mediation analysis generate biased results when the mediator--outcome relationship depends on the treatment condition. This article shows how the ratio-of-mediator-probability weighting (RMPW) method can be used to decompose total effects into natural direct and indirect effects in the presence of treatment-by-mediator…
Descriptors: Weighted Scores, Probability, Statistical Analysis, Interaction
Previous Page | Next Page »
Pages: 1 | 2