Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 13 |
| Since 2007 (last 20 years) | 29 |
Descriptor
| Correlation | 40 |
| Evaluation Methods | 40 |
| Sample Size | 40 |
| Factor Analysis | 13 |
| Monte Carlo Methods | 13 |
| Simulation | 12 |
| Statistical Analysis | 10 |
| Error of Measurement | 9 |
| Item Response Theory | 8 |
| Computation | 7 |
| Comparative Analysis | 6 |
| More ▼ | |
Source
Author
| Porter, Kristin E. | 3 |
| Ahn, Soyeon | 1 |
| An, Min | 1 |
| Arvey, Richard D. | 1 |
| Avsec, Stanislav | 1 |
| Ballard, Laura D. | 1 |
| Beavers, Daniel P. | 1 |
| Ben Domingue | 1 |
| Ben Stenhaug | 1 |
| Beretvas, S. Natasha | 1 |
| Chan, Daniel W.-L. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 32 |
| Reports - Research | 30 |
| Reports - Evaluative | 4 |
| Dissertations/Theses -… | 3 |
| Guides - Non-Classroom | 3 |
| Speeches/Meeting Papers | 1 |
Education Level
| Elementary Education | 6 |
| Intermediate Grades | 3 |
| Middle Schools | 3 |
| Secondary Education | 3 |
| Elementary Secondary Education | 2 |
| Grade 6 | 2 |
| Grade 5 | 1 |
| Grade 8 | 1 |
| High Schools | 1 |
| Higher Education | 1 |
| Junior High Schools | 1 |
| More ▼ | |
Audience
| Researchers | 3 |
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
| Program for International… | 1 |
| Progress in International… | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Yan Xia; Xinchang Zhou – Educational and Psychological Measurement, 2025
Parallel analysis has been considered one of the most accurate methods for determining the number of factors in factor analysis. One major advantage of parallel analysis over traditional factor retention methods (e.g., Kaiser's rule) is that it addresses the sampling variability of eigenvalues obtained from the identity matrix, representing the…
Descriptors: Factor Analysis, Statistical Analysis, Evaluation Methods, Sampling
Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023
A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…
Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation
Paul A. Jewsbury; Matthew S. Johnson – Large-scale Assessments in Education, 2025
The standard methodology for many large-scale assessments in education involves regressing latent variables on numerous contextual variables to estimate proficiency distributions. To reduce the number of contextual variables used in the regression and improve estimation, we propose and evaluate principal component analysis on the covariance matrix…
Descriptors: Factor Analysis, Matrices, Regression (Statistics), Educational Assessment
Ben Stenhaug; Ben Domingue – Grantee Submission, 2022
The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…
Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models
Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022
To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…
Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences
Li, Xinru; Dusseldorp, Elise; Meulman, Jacqueline J. – Research Synthesis Methods, 2019
In meta-analytic studies, there are often multiple moderators available (eg, study characteristics). In such cases, traditional meta-analysis methods often lack sufficient power to investigate interaction effects between moderators, especially high-order interactions. To overcome this problem, meta-CART was proposed: an approach that applies…
Descriptors: Correlation, Meta Analysis, Identification, Testing
Kogar, Hakan – Journal of Education and Learning, 2018
The aim of the present research study was to compare the findings from the nonparametric MSA, DIMTEST and DETECT and the parametric dimensionality determining methods in various simulation conditions by utilizing exploratory and confirmatory methods. For this purpose, various simulation conditions were established based on number of dimensions,…
Descriptors: Evaluation Methods, Nonparametric Statistics, Statistical Analysis, Factor Analysis
Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021
This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…
Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods
Wang, Xiaolin; Svetina, Dubravka; Dai, Shenghai – Journal of Experimental Education, 2019
Recently, interest in test subscore reporting for diagnosis purposes has been growing rapidly. The two simulation studies here examined factors (sample size, number of subscales, correlation between subscales, and three factors affecting subscore reliability: number of items per subscale, item parameter distribution, and data generating model)…
Descriptors: Value Added Models, Scores, Sample Size, Correlation
Porter, Kristin E. – Journal of Research on Educational Effectiveness, 2018
Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs) are statistical…
Descriptors: Statistical Analysis, Program Effectiveness, Intervention, Hypothesis Testing
Peter, Johannes; Rosman, Tom; Mayer, Anne-Kathrin; Leichner, Nikolas; Krampen, Günter – British Journal of Educational Psychology, 2016
Background: Particularly in higher education, not only a view of science as a means of finding absolute truths (absolutism), but also a view of science as generally tentative (multiplicism) can be unsophisticated and obstructive for learning. Most quantitative epistemic belief inventories neglect this and understand epistemic sophistication as…
Descriptors: Beliefs, Epistemology, Psychology, Factor Analysis
Porter, Kristin E. – Grantee Submission, 2017
Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs) are statistical…
Descriptors: Statistical Analysis, Program Effectiveness, Intervention, Hypothesis Testing
Yu, Chong Ho; Douglas, Samantha; Lee, Anna; An, Min – Practical Assessment, Research & Evaluation, 2016
This paper aims to illustrate how data visualization could be utilized to identify errors prior to modeling, using an example with multi-dimensional item response theory (MIRT). MIRT combines item response theory and factor analysis to identify a psychometric model that investigates two or more latent traits. While it may seem convenient to…
Descriptors: Visualization, Item Response Theory, Sample Size, Correlation
Yoo, Hanwook; Wolf, Mikyung Kim; Ballard, Laura D. – Practical Assessment, Research & Evaluation, 2023
As the theme of the 2022 annual meeting of the American Education Research Association, cultivating equitable education systems has gained renewed attention amid an increasingly diverse society. However, systemic inequalities persist for traditionally underserved student populations. As a way to better address diverse students' needs, it is of…
Descriptors: Comparative Analysis, Native Language, English Language Learners, Multilingualism
Porter, Kristin E. – MDRC, 2016
In education research and in many other fields, researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple…
Descriptors: Statistical Analysis, Program Effectiveness, Intervention, Hypothesis Testing

Peer reviewed
Direct link
