Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 9 |
| Since 2007 (last 20 years) | 68 |
Descriptor
| Sample Size | 91 |
| Statistics | 91 |
| Statistical Analysis | 36 |
| Probability | 23 |
| Sampling | 17 |
| Computation | 16 |
| Hypothesis Testing | 16 |
| Mathematics Instruction | 13 |
| Simulation | 13 |
| Statistical Inference | 13 |
| Data | 11 |
| More ▼ | |
Source
Author
| Yuan, Ke-Hai | 3 |
| Erford, Bradley T. | 2 |
| Huck, Schuyler W. | 2 |
| MacDonald, Justin A. | 2 |
| Trafimow, David | 2 |
| Watson, Jane | 2 |
| Allison, David B. | 1 |
| Ansley, Timothy N. | 1 |
| Anyieni, Abel G. | 1 |
| Areri, Damaris K. | 1 |
| Aspinwall, Leslie | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 14 |
| Postsecondary Education | 7 |
| Secondary Education | 7 |
| High Schools | 5 |
| Middle Schools | 3 |
| Adult Education | 2 |
| Grade 4 | 2 |
| Grade 6 | 2 |
| Grade 8 | 2 |
| Early Childhood Education | 1 |
| Elementary Education | 1 |
| More ▼ | |
Audience
| Teachers | 9 |
| Practitioners | 5 |
| Researchers | 2 |
| Media Staff | 1 |
| Students | 1 |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Advanced Placement… | 1 |
| Iowa Tests of Basic Skills | 1 |
What Works Clearinghouse Rating
Conrad Borchers – International Educational Data Mining Society, 2025
Algorithmic bias is a pressing concern in educational data mining (EDM), as it risks amplifying inequities in learning outcomes. The Area Between ROC Curves (ABROCA) metric is frequently used to measure discrepancies in model performance across demographic groups to quantify overall model fairness. However, its skewed distribution--especially when…
Descriptors: Algorithms, Bias, Statistics, Simulation
Christopher Martin Amissah – ProQuest LLC, 2024
Measurement of latent constructs is one of the most challenging tasks in psychological research. Unlike physical variables, latent constructs are not directly observable but are inferred through individuals' responses to a set of items often referred to as measurement instruments, tests, surveys, or assessments. For decades, exploratory factor…
Descriptors: Models, Psychological Studies, Replication (Evaluation), Factor Analysis
Shi, Jiandong; Luo, Dehui; Weng, Hong; Zeng, Xian-Tao; Lin, Lu; Chu, Haitao; Tong, Tiejun – Research Synthesis Methods, 2020
When reporting the results of clinical studies, some researchers may choose the five-number summary (including the sample median, the first and third quartiles, and the minimum and maximum values) rather than the sample mean and standard deviation (SD), particularly for skewed data. For these studies, when included in a meta-analysis, it is often…
Descriptors: Statistics, Computation, Sample Size, Mathematical Formulas
Jin, Kuan-Yu; Eckes, Thomas – Measurement: Interdisciplinary Research and Perspectives, 2022
Recent research on rater effects in performance assessments has increasingly focused on rater centrality, the tendency to assign scores clustering around the rating scale's middle categories. In the present paper, we adopted Jin and Wang's (2018) extended facets modeling approach and constructed a centrality continuum, ranging from raters…
Descriptors: Performance Based Assessment, Evaluators, Scoring, Sample Size
Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021
S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…
Descriptors: Statistics, Goodness of Fit, Test Items, Models
Pérez-Ferreirós, Alexandra; Kalén, Anton; Gómez, Miguel-Ángel; Rey, Ezequiel – Research Quarterly for Exercise and Sport, 2019
In basketball, game-related statistics are the most common measure of performance. However, the literature assessing their reliability is scarce. Purpose: Analyze the number of games required to obtain a good relative and absolute reliability of teams' game-related statistics. Method: A total of 884 games from the 2015-2016 to 2017-2018 seasons of…
Descriptors: Team Sports, Statistics, Reliability, Foreign Countries
Trafimow, David; MacDonald, Justin A. – Educational and Psychological Measurement, 2017
Typically, in education and psychology research, the investigator collects data and subsequently performs descriptive and inferential statistics. For example, a researcher might compute group means and use the null hypothesis significance testing procedure to draw conclusions about the populations from which the groups were drawn. We propose an…
Descriptors: Statistical Inference, Statistics, Data Collection, Equations (Mathematics)
Lehrer, Richard – Statistics Education Research Journal, 2017
Grade 6 (modal age 11) students invented and revised models of the variability generated as each measured the perimeter of a table in their classroom. To construct models, students represented variability as a linear composite of true measure (signal) and multiple sources of random error. Students revised models by developing sampling…
Descriptors: Models, Statistics, Statistical Inference, Mathematics Instruction
Spencer, Neil H.; Lay, Margaret; Kevan de Lopez, Lindsey – International Journal of Social Research Methodology, 2017
When undertaking quantitative hypothesis testing, social researchers need to decide whether the data with which they are working is suitable for parametric analyses to be used. When considering the relevant assumptions they can examine graphs and summary statistics but the decision making process is subjective and must also take into account the…
Descriptors: Evaluation Methods, Decision Making, Hypothesis Testing, Social Science Research
Phillips, Gary W.; Jiang, Tao – Practical Assessment, Research & Evaluation, 2016
Power analysis is a fundamental prerequisite for conducting scientific research. Without power analysis the researcher has no way of knowing whether the sample size is large enough to detect the effect he or she is looking for. This paper demonstrates how psychometric factors such as measurement error and equating error affect the power of…
Descriptors: Error of Measurement, Statistical Analysis, Equated Scores, Sample Size
Corcoran, Mimi – Mathematics Teacher, 2016
Statistics is enjoying some well-deserved limelight across mathematics curricula of late. Some statistical concepts, however, are not especially intuitive, and students struggle to comprehend and apply them. As an AP Statistics teacher, the author appreciates the central limit theorem as a foundational concept that plays a crucial role in…
Descriptors: Statistics, Mathematics Instruction, Mathematical Concepts, Learning Activities
English, Lyn D.; Watson, Jane M. – Journal for Research in Mathematics Education, 2016
The authors analyzed the development of 4th-grade students' understanding of the transition from experimental relative frequencies of outcomes to theoretical probabilities with a focus on the foundational statistical concepts of variation and expectation. After observing the decreasing variation from the theoretical probability as the sample size…
Descriptors: Elementary School Mathematics, Elementary School Students, Mathematical Concepts, Concept Formation
Inzunsa Cazares, Santiago – North American Chapter of the International Group for the Psychology of Mathematics Education, 2016
This article presents the results of a qualitative research with a group of 15 university students of social sciences on informal inferential reasoning developed in a computer environment on concepts involved in the confidence intervals. The results indicate that students developed a correct reasoning about sampling variability and visualized…
Descriptors: Qualitative Research, College Students, Inferences, Logical Thinking
Noll, Jennifer; Sharma, Sashi – Journal of Statistics Education, 2014
The "law of large numbers" indicates that as sample size increases, sample statistics become less variable and more closely estimate their corresponding population parameters. Different research studies investigating how people consider sample size when evaluating the reliability of a sample statistic have found a wide range of…
Descriptors: Qualitative Research, Meta Analysis, Hospitals, Task Analysis
Fask, Alan; Englander, Fred; Wang, Zhaobo – Practical Assessment, Research & Evaluation, 2015
There has been a remarkable growth in distance learning courses in higher education. Despite indications that distance learning courses are more vulnerable to cheating behavior than traditional courses, there has been little research studying whether online exams facilitate a relatively greater level of cheating. This article examines this issue…
Descriptors: Distance Education, Introductory Courses, Statistics, Cheating

Peer reviewed
Direct link
