ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	12

Descriptor

Evaluation Methods	30
Hypothesis Testing	30
Statistical Significance	30
Statistical Analysis	10
Program Evaluation	7
Research Methodology	7
Effect Size	6
Statistical Inference	6
Comparative Analysis	5
Probability	5
Analysis of Variance	4
Foreign Countries	4
Measurement Techniques	4
Program Effectiveness	4
Replication (Evaluation)	4
Research Design	4
Control Groups	3
Educational Environment	3
Educational Research	3
Evaluation Problems	3
Experimental Groups	3
Experiments	3
Goodness of Fit	3
Literature Reviews	3
Misconceptions	3
More ▼

Source

Evaluation Review	3
Psychological Methods	3
PS: Political Science and…	2
African Higher Education…	1
Assessment & Evaluation in…	1
Educational Administration…	1
Educational Researcher	1
Educational and Psychological…	1
Empirical Education Inc.	1
European Journal of…	1
Evaluation and Program…	1
Information Research: An…	1
International Education…	1
Journal of Experimental…	1
Learning Disabilities: A…	1
Online Submission	1
Social Forces	1
Structural Equation Modeling	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	13
Reports - Evaluative	8
Reports - Descriptive	6
Speeches/Meeting Papers	5
Information Analyses	2
Opinion Papers	2
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Secondary Education	1
Grade 10	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

Lithuania	1
Nigeria	1
Saudi Arabia	1
United Kingdom	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Social Skills Rating System

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Document Level Assessment of Document Retrieval Systems in a Pairwise System Evaluation

Peer reviewed
PDF on ERIC

Download full text

Rajagopal, Prabha; Ravana, Sri Devi – Information Research: An International Electronic Journal, 2017

Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…

Descriptors: Information Retrieval, Documentation, Scores, Information Systems

The Effect of Peer Assessment on the Evaluation Process of Students

Peer reviewed
PDF on ERIC

Download full text

Alzaid, Jawaher Mohammed – International Education Studies, 2017

This study aims at finding out the effect of peer assessment on the evaluation process of students. The hypothesis underlying this study is that assessment is an integral part of the learning process, which should play an important role in the educational model. The current study will emphasize the importance of using peer assessment as a tool to…

Descriptors: Foreign Countries, College Students, Peer Evaluation, Student Evaluation

Factorial Invariance in Multiple Populations: A Multiple Testing Procedure

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Millsap, Roger E. – Educational and Psychological Measurement, 2013

A multiple testing method for examining factorial invariance for latent constructs evaluated by multiple indicators in distinct populations is outlined. The procedure is based on the false discovery rate concept and multiple individual restriction tests and resolves general limitations of a popular factorial invariance testing approach. The…

Descriptors: Testing, Statistical Analysis, Factor Analysis, Statistical Significance

What if Best Practice Is Too Expensive? Feedback on Oral Presentations and Efficient Use of Resources

Peer reviewed

Direct link

Leger, Lawrence A.; Glass, Karligash; Katsiampa, Paraskevi; Liu, Shibo; Sirichand, Kavita – Assessment & Evaluation in Higher Education, 2017

We evaluate feedback methods for oral presentations used in training non-quantitative research skills (literature review and various associated tasks). Training is provided through a credit-bearing module taught to MSc students of banking, economics and finance in the UK. Monitoring oral presentations and providing "best practice"…

Descriptors: Foreign Countries, Graduate Students, Masters Programs, Feedback (Response)

Education of Social Skills among Senior High School Age Students in Physical Education Classes

Peer reviewed
PDF on ERIC

Download full text

Akelaitis, Arturas V.; Malinauskas, Romualdas K. – European Journal of Contemporary Education, 2016

Research aim was to reveal peculiarities of the education of social skills among senior high school age students in physical education classes. We hypothesized that after the end of the educational experiment the senior high school age students will have more developed social skills in physical education classes. Participants in the study were 51…

Descriptors: Foreign Countries, Interpersonal Competence, High School Seniors, Physical Education

In School Settings, Are All RCTs (Randomized Control Trials) Exploratory?

Direct link

Newman, Denis; Jaciw, Andrew P. – Empirical Education Inc., 2012

The motivation for this paper is the authors' recent work on several randomized control trials in which they found the primary result, which averaged across subgroups or sites, to be moderated by demographic or site characteristics. They are led to examine a distinction that the Institute of Education Sciences (IES) makes between "confirmatory"…

Descriptors: Educational Research, Research Methodology, Research Design, Classification

Killeen's (2005) "p[subscript rep]" Coefficient: Logical and Mathematical Problems

Peer reviewed

Direct link

Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010

In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…

Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance

Improving Students' Diagram Comprehension with Classroom Instruction

Peer reviewed

Direct link

Cromley, Jennifer G.; Perez, Tony C.; Fitzhugh, Shannon L.; Newcombe, Nora S.; Wills, Theodore W.; Tanaka, Jacqueline C. – Journal of Experimental Education, 2013

The authors tested whether students can be taught to better understand conventional representations in diagrams, photographs, and other visual representations in science textbooks. The authors developed a teacher-delivered, workbook-and-discussion-based classroom instructional method called Conventions of Diagrams (COD). The authors trained 1…

Descriptors: Visual Aids, Textbooks, Biology, Grade 10

Regarding "p[subscript rep]": Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Serlin, Ronald C. – Psychological Methods, 2010

The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…

Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques

Replication, "p[subscript rep]," and Confidence Intervals: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Cumming, Geoff – Psychological Methods, 2010

This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity

Alphas and Asterisks: The Development of Statistical Significance Testing Standards in Sociology

Peer reviewed

Direct link

Leahey, Erin – Social Forces, 2005

In this paper, I trace the development of statistical significance testing standards in sociology by analyzing data from articles published in two prestigious sociology journals between 1935 and 2000. I focus on the role of two key elements in the diffusion literature, contagion and rationality, as well as the role of institutional factors. I find…

Descriptors: Evaluation Methods, Hypothesis Testing, Sociology, Statistical Significance

Policy Implications of Using Significance Tests in Evaluation Research.

Peer reviewed

Schneider, Anne L.; Darcy, Robert E. – Evaluation Review, 1984

The normative implications of applying significance tests in evaluation research are examined. The authors conclude that evaluators often make normative decisions, based on the traditional .05 significance level in studies with small samples. Additional reporting of the magnitude of impact, the significance level, and the power of the test is…

Descriptors: Evaluation Methods, Hypothesis Testing, Research Methodology, Research Problems

Balkanization and Unification of Probabilistic Inferences

Download full text

Yu, Chong-Ho – Online Submission, 2005

Many research-related classes in social sciences present probability as a unified approach based upon mathematical axioms, but neglect the diversity of various probability theories and their associated philosophical assumptions. Although currently the dominant statistical and probabilistic approach is the Fisherian tradition, the use of Fisherian…

Descriptors: Probability, Inferences, Social Sciences, Statistical Significance

Treatment Implementation and Statistical Power: A Research Note.

Peer reviewed

Cook, Thomas J.; Poole, W. Kenneth – Evaluation Review, 1982

The assumption of equal treatment implementation is questioned. Through the reanalysis of data from a nutrition supplementation program evaluation, the power of the analysis of treatment effects is shown to increase when data on the level of treatment implementation is included. (Author/CM)

Descriptors: Evaluation Methods, Hypothesis Testing, Power (Statistics), Program Evaluation

A Binary Time-Series Analysis of Domestic Child Homicide: On Monitoring Critical, Rare Criteria of System Performance.

Peer reviewed

Maney, A. C.; Kedem, Benjamin – Evaluation Review, 1982

A novel solution to the statistical problems in an evaluation of rare events is described. The significance of variations in the number of child homicides is analyzed in a binary time series of "active" months for monitoring future incidence and related systemic events. (Author/CM)

Descriptors: Child Abuse, Crime, Evaluation Methods, Hypothesis Testing

Previous Page | Next Page »

Pages: 1 | 2

Ajuonuma, Juliet O.	1
Akelaitis, Arturas V.	1
Alzaid, Jawaher Mohammed	1
Anderson, Judith I.	1
Bernard, Michael E.	1
Byrd, Jimmy K.	1
Cook, Thomas J.	1
Cromley, Jennifer G.	1
Cumming, Geoff	1
Darcy, Robert E.	1
Dunivant, Noel	1
Eagles, Munroe	1
Estes, Gary D.	1
Fitzhugh, Shannon L.	1
Fraas, John W.	1
Gabriel, Stephanie	1
Glass, Karligash	1
Gullickson, Arlen R.	1
Hail, Michael	1
Hanes, John C.	1
Hau, Kit-Tai	1
Jaciw, Andrew P.	1
Jackman, Robert W.	1
Katsiampa, Paraskevi	1
More ▼