Publication Date
| In 2026 | 0 |
| Since 2025 | 14 |
| Since 2022 (last 5 years) | 81 |
| Since 2017 (last 10 years) | 356 |
| Since 2007 (last 20 years) | 973 |
Descriptor
| Statistical Analysis | 1095 |
| Computation | 1087 |
| Models | 217 |
| Comparative Analysis | 178 |
| Foreign Countries | 154 |
| Correlation | 152 |
| Sample Size | 145 |
| Regression (Statistics) | 126 |
| Error of Measurement | 119 |
| Scores | 115 |
| Effect Size | 110 |
| More ▼ | |
Source
Author
| Raykov, Tenko | 22 |
| Marcoulides, George A. | 13 |
| Schochet, Peter Z. | 11 |
| Dong, Nianbo | 9 |
| Moses, Tim | 9 |
| Cho, Sun-Joo | 8 |
| Konstantopoulos, Spyros | 8 |
| Spybrook, Jessaca | 8 |
| Reardon, Sean F. | 7 |
| Shieh, Gwowen | 7 |
| Zhang, Zhiyong | 7 |
| More ▼ | |
Publication Type
Education Level
Location
| Canada | 15 |
| Australia | 13 |
| Turkey | 13 |
| Texas | 12 |
| California | 10 |
| Germany | 10 |
| Netherlands | 9 |
| United Kingdom | 9 |
| United States | 9 |
| Massachusetts | 7 |
| Pennsylvania | 7 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2019
M-fluctuation tests are a recently proposed method for detecting differential item functioning in Rasch models. This article discusses a generalization of this method to two additional item response theory models: the two-parametric logistic model and the three-parametric logistic model with a common guessing parameter. The Type I error rate and…
Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Maximum Likelihood Statistics
Choi, Jinnie – Journal of Educational and Behavioral Statistics, 2017
This article reviews PROC IRT, which was added to Statistical Analysis Software in 2014. We provide an introductory overview of a free version of SAS, describe what PROC IRT offers for item response theory (IRT) analysis and how one can use PROC IRT, and discuss how other SAS macros and procedures may compensate the IRT functionalities of PROC IRT.
Descriptors: Item Response Theory, Computer Software, Statistical Analysis, Computation
Zhou, Sherry; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2020
The semi-generalized partial credit model (Semi-GPCM) has been proposed as a unidimensional modeling method for handling not applicable scale responses and neutral scale responses, and it has been suggested that the model may be of use in handling missing data in scale items. The purpose of this study is to evaluate the ability of the…
Descriptors: Models, Statistical Analysis, Response Style (Tests), Test Items
No, Unkyung; Hong, Sehee – Educational and Psychological Measurement, 2018
The purpose of the present study is to compare performances of mixture modeling approaches (i.e., one-step approach, three-step maximum-likelihood approach, three-step BCH approach, and LTB approach) based on diverse sample size conditions. To carry out this research, two simulation studies were conducted with two different models, a latent class…
Descriptors: Sample Size, Classification, Comparative Analysis, Statistical Analysis
Taylor, Joseph A.; Kowalski, Susan M.; Polanin, Joshua R.; Askinas, Karen; Stuhlsatz, Molly A. M.; Wilson, Christopher D.; Tipton, Elizabeth; Wilson, Sandra Jo – AERA Open, 2018
A priori power analyses allow researchers to estimate the number of participants needed to detect the effects of an intervention. However, power analyses are only as valid as the parameter estimates used. One such parameter, the expected effect size, can vary greatly depending on several study characteristics, including the nature of the…
Descriptors: Science Education, Statistical Analysis, Effect Size, Intervention
Nestler, Steffen – Journal of Educational and Behavioral Statistics, 2018
The social relations model (SRM) is a mathematical model that can be used to analyze interpersonal judgment and behavior data. Typically, the SRM is applied to one (i.e., univariate SRM) or two variables (i.e., bivariate SRM), and parameter estimates are obtained by employing an analysis of variance method. Here, we present an extension of the SRM…
Descriptors: Mathematical Models, Interpersonal Relationship, Maximum Likelihood Statistics, Computation
Uanhoro, James Ohisei; O'Connell, Ann A. – AERA Online Paper Repository, 2018
There have been increasing calls for applied researchers to see and utilize effect sizes as the primary outcomes of their research. However, this sometimes places a methodological burden on researchers whose primary interests are substantive. Motivated by a desire to help applied researchers better report effect sizes and their confidence…
Descriptors: Effect Size, Computation, Statistical Analysis, Hierarchical Linear Modeling
Swan, Daniel M.; Pustejovsky, James E. – Grantee Submission, 2018
Single-case designs are a class of repeated measures experiments used to evaluate the effects of interventions for small or specialized populations, such as individuals with low-incidence disabilities. There has been growing interest in systematic reviews and syntheses of evidence from single-case designs, but there remains a need to further…
Descriptors: Research Design, Intervention, Effect Size, Statistical Analysis
Jacobson, Michael J.; Levin, James A.; Kapur, Manu – Educational Researcher, 2019
Education is a complex system, which has conceptual and methodological implications for education research and policy. In this article, an overview is first provided of the Complex Systems Conceptual Framework for Learning (CSCFL), which consists of a set of conceptual perspectives that are generally shared by educational complex systems,…
Descriptors: Systems Approach, Group Behavior, Behavior, Educational Research
Yang, Shitao; Black, Ken – Teaching Statistics: An International Journal for Teachers, 2019
Summary Employing a Wald confidence interval to test hypotheses about population proportions could lead to an increase in Type I or Type II errors unless the hypothesized value, p0, is used in computing its standard error rather than the sample proportion. Whereas the Wald confidence interval to estimate a population proportion uses the sample…
Descriptors: Error Patterns, Evaluation Methods, Error of Measurement, Measurement Techniques
Leite, Walter L.; Aydin, Burak; Gurel, Sungur – Journal of Experimental Education, 2019
This Monte Carlo simulation study compares methods to estimate the effects of programs with multiple versions when assignment of individuals to program version is not random. These methods use generalized propensity scores, which are predicted probabilities of receiving a particular level of the treatment conditional on covariates, to remove…
Descriptors: Probability, Weighted Scores, Monte Carlo Methods, Statistical Bias
Cominole, Melissa; Ritchie, Nichole Smith; Cooney, Jennifer – National Center for Education Statistics, 2021
This publication describes the methods and procedures used for the 2008/18 Baccalaureate and Beyond Longitudinal Study (B&B:08/18). The B&B graduates, who completed the requirements for a bachelor's degree during the 2007-08 academic year, were first surveyed as part of the 2008 National Postsecondary Student Aid Study (NPSAS:08), and then…
Descriptors: Bachelors Degrees, College Graduates, Longitudinal Studies, Data Collection
Paul T. von Hippel; Laura Bellows – Annenberg Institute for School Reform at Brown University, 2020
At least sixteen US states have taken steps toward holding teacher preparation programs (TPPs) accountable for teacher value-added to student test scores. Yet it is unclear whether teacher quality differences between TPPs are large enough to make an accountability system worthwhile. Several statistical practices can make differences between TPPs…
Descriptors: Teacher Effectiveness, Teacher Education Programs, Scores, Accountability
Li, Wei; Konstantopoulos, Spyros – Journal of Experimental Education, 2019
Education experiments frequently assign students to treatment or control conditions within schools. Longitudinal components added in these studies (e.g., students followed over time) allow researchers to assess treatment effects in average rates of change (e.g., linear or quadratic). We provide methods for a priori power analysis in three-level…
Descriptors: Research Design, Statistical Analysis, Sample Size, Effect Size
Raykov, Tenko; Dimitrov, Dimiter M.; Marcoulides, George A.; Harrison, Michael – Educational and Psychological Measurement, 2019
Building on prior research on the relationships between key concepts in item response theory and classical test theory, this note contributes to highlighting their important and useful links. A readily and widely applicable latent variable modeling procedure is discussed that can be used for point and interval estimation of the individual person…
Descriptors: True Scores, Item Response Theory, Test Items, Test Theory

Peer reviewed
Direct link
