NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022
The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…
Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024
I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…
Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022
Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…
Descriptors: Computation, Data Analysis, Educational Testing, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Schochet, Peter Z. – Journal of Educational and Behavioral Statistics, 2020
This article discusses estimation of average treatment effects for randomized controlled trials (RCTs) using grouped administrative data to help improve data access. The focus is on design-based estimators, derived using the building blocks of experiments, that are conducive to grouped data for a wide range of RCT designs, including clustered and…
Descriptors: Randomized Controlled Trials, Data Analysis, Research Design, Multivariate Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Doval, Eduardo; Delicado, Pedro – Journal of Educational and Behavioral Statistics, 2020
We propose new methods for identifying and classifying aberrant response patterns (ARPs) by means of functional data analysis. These methods take the person response function (PRF) of an individual and compare it with the pattern that would correspond to a generic individual of the same ability according to the item-person response surface. ARPs…
Descriptors: Response Style (Tests), Data Analysis, Identification, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Hao, Jiangang; Ho, Tin Kam – Journal of Educational and Behavioral Statistics, 2019
Machine learning is a popular topic in data analysis and modeling. Many different machine learning algorithms have been developed and implemented in a variety of programming languages over the past 20 years. In this article, we first provide an overview of machine learning and clarify its difference from statistical inference. Then, we review…
Descriptors: Artificial Intelligence, Statistical Inference, Data Analysis, Programming Languages
Peer reviewed Peer reviewed
Direct linkDirect link
Hayes, Timothy – Journal of Educational and Behavioral Statistics, 2019
Multiple imputation is a popular method for addressing data that are presumed to be missing at random. To obtain accurate results, one's imputation model must be congenial to (appropriate for) one's intended analysis model. This article reviews and demonstrates two recent software packages, Blimp and jomo, to multiply impute data in a manner…
Descriptors: Computer Software Evaluation, Computer Software Reviews, Hierarchical Linear Modeling, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Slater, Stefan; Joksimovic, Srecko; Kovanovic, Vitomir; Baker, Ryan S.; Gasevic, Dragan – Journal of Educational and Behavioral Statistics, 2017
In recent years, a wide array of tools have emerged for the purposes of conducting educational data mining (EDM) and/or learning analytics (LA) research. In this article, we hope to highlight some of the most widely used, most accessible, and most powerful tools available for the researcher interested in conducting EDM/LA research. We will…
Descriptors: Data Analysis, Data Processing, Computer Uses in Education, Educational Research
Peer reviewed Peer reviewed
Direct linkDirect link
Vidotto, Davide; Vermunt, Jeroen K.; van Deun, Katrijn – Journal of Educational and Behavioral Statistics, 2018
With this article, we propose using a Bayesian multilevel latent class (BMLC; or mixture) model for the multiple imputation of nested categorical data. Unlike recently developed methods that can only pick up associations between pairs of variables, the multilevel mixture model we propose is flexible enough to automatically deal with complex…
Descriptors: Bayesian Statistics, Multivariate Analysis, Data, Hierarchical Linear Modeling
Peer reviewed Peer reviewed
Direct linkDirect link
Rousson, Valentin – Journal of Educational and Behavioral Statistics, 2014
It is well known that dichotomizing continuous data has the effect to decrease statistical power when the goal is to test for a statistical association between two variables. Modern researchers however are focusing not only on statistical significance but also on an estimation of the "effect size" (i.e., the strength of association…
Descriptors: Effect Size, Correlation, Statistical Analysis, Data
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Jinming – Journal of Educational and Behavioral Statistics, 2012
The impact of uncertainty about item parameters on test information functions is investigated. The information function of a test is one of the most important tools in item response theory (IRT). Inaccuracy in the estimation of test information can have substantial consequences on data analyses based on IRT. In this article, the major part (called…
Descriptors: Item Response Theory, Tests, Accuracy, Data Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Moses, Tim – Journal of Educational and Behavioral Statistics, 2008
Equating functions are supposed to be population invariant, meaning that the choice of subpopulation used to compute the equating function should not matter. The extent to which equating functions are population invariant is typically assessed in terms of practical difference criteria that do not account for equating functions' sampling…
Descriptors: Equated Scores, Error of Measurement, Sampling, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Matthias; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2007
Reporting methods used in large-scale assessments such as the National Assessment of Educational Progress (NAEP) rely on latent regression models. To fit the latent regression model using the maximum likelihood estimation technique, multivariate integrals must be evaluated. In the computer program MGROUP used by the Educational Testing Service for…
Descriptors: Simulation, Computer Software, Sampling, Data Analysis
Peer reviewed Peer reviewed
Rogosa, David; Saner, Hilary – Journal of Educational and Behavioral Statistics, 1995
Longitudinal panel data examples are used to illustrate estimation methods for individual growth curve models. These examples demonstrate issues and concerns in the application of hierarchical modeling estimation, specifically the hierarchical linear models of A. S. Bryk and S. W. Raudenbush. (SLD)
Descriptors: Data Analysis, Estimation (Mathematics), Longitudinal Studies
Peer reviewed Peer reviewed
Wainer, Howard – Journal of Educational and Behavioral Statistics, 1997
Four guidelines that make tables more effective data displays are presented. The need for these guidelines and their application are illustrated with data from the National Assessment of Educational Progress (NAEP). A theoretical structure is presented to help develop test items to assess students' proficiency in extracting information from…
Descriptors: Comprehension, Data Interpretation, Elementary Secondary Education, Information Dissemination
Previous Page | Next Page »
Pages: 1  |  2