Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 12 |
Descriptor
| Evaluation Methods | 30 |
| Hypothesis Testing | 30 |
| Statistical Significance | 30 |
| Statistical Analysis | 10 |
| Program Evaluation | 7 |
| Research Methodology | 7 |
| Effect Size | 6 |
| Statistical Inference | 6 |
| Comparative Analysis | 5 |
| Probability | 5 |
| Analysis of Variance | 4 |
| More ▼ | |
Source
Author
Publication Type
| Journal Articles | 21 |
| Reports - Research | 13 |
| Reports - Evaluative | 8 |
| Reports - Descriptive | 6 |
| Speeches/Meeting Papers | 5 |
| Information Analyses | 2 |
| Opinion Papers | 2 |
| Guides - Non-Classroom | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 3 |
| Elementary Secondary Education | 1 |
| Grade 10 | 1 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| Grade 9 | 1 |
| High Schools | 1 |
| More ▼ | |
Audience
Location
| Lithuania | 1 |
| Nigeria | 1 |
| Saudi Arabia | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
| Social Skills Rating System | 1 |
What Works Clearinghouse Rating
Rajagopal, Prabha; Ravana, Sri Devi – Information Research: An International Electronic Journal, 2017
Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…
Descriptors: Information Retrieval, Documentation, Scores, Information Systems
Alzaid, Jawaher Mohammed – International Education Studies, 2017
This study aims at finding out the effect of peer assessment on the evaluation process of students. The hypothesis underlying this study is that assessment is an integral part of the learning process, which should play an important role in the educational model. The current study will emphasize the importance of using peer assessment as a tool to…
Descriptors: Foreign Countries, College Students, Peer Evaluation, Student Evaluation
Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013
A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…
Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance
Leger, Lawrence A.; Glass, Karligash; Katsiampa, Paraskevi; Liu, Shibo; Sirichand, Kavita – Assessment & Evaluation in Higher Education, 2017
We evaluate feedback methods for oral presentations used in training non-quantitative research skills (literature review and various associated tasks). Training is provided through a credit-bearing module taught to MSc students of banking, economics and finance in the UK. Monitoring oral presentations and providing "best practice"…
Descriptors: Foreign Countries, Graduate Students, Masters Programs, Feedback (Response)
Akelaitis, Arturas V.; Malinauskas, Romualdas K. – European Journal of Contemporary Education, 2016
Research aim was to reveal peculiarities of the education of social skills among senior high school age students in physical education classes. We hypothesized that after the end of the educational experiment the senior high school age students will have more developed social skills in physical education classes. Participants in the study were 51…
Descriptors: Foreign Countries, Interpersonal Competence, High School Seniors, Physical Education
Newman, Denis; Jaciw, Andrew P. – Empirical Education Inc., 2012
The motivation for this paper is the authors' recent work on several randomized control trials in which they found the primary result, which averaged across subgroups or sites, to be moderated by demographic or site characteristics. They are led to examine a distinction that the Institute of Education Sciences (IES) makes between "confirmatory"…
Descriptors: Educational Research, Research Methodology, Research Design, Classification
Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010
In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…
Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance
Cromley, Jennifer G.; Perez, Tony C.; Fitzhugh, Shannon L.; Newcombe, Nora S.; Wills, Theodore W.; Tanaka, Jacqueline C. – Journal of Experimental Education, 2013
The authors tested whether students can be taught to better understand conventional representations in diagrams, photographs, and other visual representations in science textbooks. The authors developed a teacher-delivered, workbook-and-discussion-based classroom instructional method called Conventions of Diagrams (COD). The authors trained 1…
Descriptors: Visual Aids, Textbooks, Biology, Grade 10
Serlin, Ronald C. – Psychological Methods, 2010
The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…
Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques
Cumming, Geoff – Psychological Methods, 2010
This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity
Leahey, Erin – Social Forces, 2005
In this paper, I trace the development of statistical significance testing standards in sociology by analyzing data from articles published in two prestigious sociology journals between 1935 and 2000. I focus on the role of two key elements in the diffusion literature, contagion and rationality, as well as the role of institutional factors. I find…
Descriptors: Evaluation Methods, Hypothesis Testing, Sociology, Statistical Significance
Peer reviewedSchneider, Anne L.; Darcy, Robert E. – Evaluation Review, 1984
The normative implications of applying significance tests in evaluation research are examined. The authors conclude that evaluators often make normative decisions, based on the traditional .05 significance level in studies with small samples. Additional reporting of the magnitude of impact, the significance level, and the power of the test is…
Descriptors: Evaluation Methods, Hypothesis Testing, Research Methodology, Research Problems
Yu, Chong-Ho – Online Submission, 2005
Many research-related classes in social sciences present probability as a unified approach based upon mathematical axioms, but neglect the diversity of various probability theories and their associated philosophical assumptions. Although currently the dominant statistical and probabilistic approach is the Fisherian tradition, the use of Fisherian…
Descriptors: Probability, Inferences, Social Sciences, Statistical Significance
Peer reviewedCook, Thomas J.; Poole, W. Kenneth – Evaluation Review, 1982
The assumption of equal treatment implementation is questioned. Through the reanalysis of data from a nutrition supplementation program evaluation, the power of the analysis of treatment effects is shown to increase when data on the level of treatment implementation is included. (Author/CM)
Descriptors: Evaluation Methods, Hypothesis Testing, Power (Statistics), Program Evaluation
Peer reviewedManey, A. C.; Kedem, Benjamin – Evaluation Review, 1982
A novel solution to the statistical problems in an evaluation of rare events is described. The significance of variations in the number of child homicides is analyzed in a binary time series of "active" months for monitoring future incidence and related systemic events. (Author/CM)
Descriptors: Child Abuse, Crime, Evaluation Methods, Hypothesis Testing
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
