ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	28
Since 2016 (last 10 years)	138
Since 2006 (last 20 years)	304

Descriptor

Probability	324
Scores	324
Statistical Analysis	79
Foreign Countries	68
Regression (Statistics)	61
Academic Achievement	57
Correlation	55
Comparative Analysis	53
Item Response Theory	42
Gender Differences	40
Models	39
College Students	38
Longitudinal Studies	34
Predictor Variables	30
Test Items	27
Outcomes of Education	26
College Entrance Examinations	25
Student Characteristics	24
Undergraduate Students	24
High School Students	23
Achievement Tests	22
Computation	22
Secondary School Students	22
Mathematics Tests	21
Bayesian Statistics	20
More ▼

Publication Type

Journal Articles	324
Reports - Research	256
Reports - Evaluative	41
Reports - Descriptive	22
Tests/Questionnaires	9
Information Analyses	5
Opinion Papers	3
Guides - Non-Classroom	1

Education Level

Higher Education	97
Postsecondary Education	66
Secondary Education	59
High Schools	32
Elementary Education	30
Middle Schools	22
Early Childhood Education	18
Junior High Schools	18
Primary Education	12
Grade 8	10
Grade 3	9
Elementary Secondary Education	8
Two Year Colleges	7
Grade 7	6
Intermediate Grades	6
Kindergarten	6
Grade 4	5
Grade 6	5
Grade 9	5
Preschool Education	5
Grade 1	3
Grade 12	3
Grade 5	3
Adult Education	2
Grade 10	2
More ▼

Audience

Researchers	3
Teachers	2

Location

Texas	8
Florida	7
United Kingdom (England)	6
Turkey	5
Germany	4
Massachusetts	4
New Jersey	4
New York	4
California	3
Canada	3
Finland	3
Netherlands	3
North Carolina	3
Tennessee	3
United States	3
West Virginia	3
China	2
Georgia	2
Indiana	2
Italy	2
Mexico	2
Minnesota	2
Nigeria	2
Ohio	2
Oklahoma	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Pell Grant Program	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 324 results Save | Export

A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems

Peer reviewed

Direct link

San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022

The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…

Descriptors: Tests, Scores, Statistical Analysis, Models

Propensity Score Methods for Causal Inference and Generalization

Peer reviewed

Direct link

Wendy Chan – Asia Pacific Education Review, 2024

As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…

Descriptors: Probability, Scores, Causal Models, Statistical Inference

A Quasiexperimental Analysis of First-Year Seminar Outcomes at a Large University

Peer reviewed

Direct link

Rajeeb Das; Erika Schmitt; Michael T. Stephenson – Journal of College Student Retention: Research, Theory & Practice, 2024

First-year seminars (FYS) comprise one of 11 researched interventions in postsecondary education known as High-Impact Practices, but few rigorous studies report significantly high impacts. This study examined a FYS employing propensity score matching to link cases and controls in a quasi-experimental design. One semester later cumulative grade…

Descriptors: College Freshmen, First Year Seminars, Scores, Probability

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

A Tutorial on Artificial Neural Networks in Propensity Score Analysis

Peer reviewed

Direct link

Collier, Zachary K.; Leite, Walter L. – Journal of Experimental Education, 2022

Artificial neural networks (NN) can help researchers estimate propensity scores for quasi-experimental estimation of treatment effects because they can automatically detect complex interactions involving many covariates. However, NN is difficult to implement due to the complexity of choosing an algorithm for various treatment levels and monitoring…

Descriptors: Artificial Intelligence, Mentors, Beginning Teachers, Teacher Persistence

Explaining Performance Decline over the Course of Taking Comprehensive Proficiency Tests: The Roles of Effort and Omission Propensity

Peer reviewed

Direct link

Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024

In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…

Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries

The Role of Distributional Overlap on the Precision Gain of Bounds for Generalization

Peer reviewed

Direct link

Chan, Wendy – American Journal of Evaluation, 2022

Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…

Descriptors: Probability, Scores, Scoring, Generalization

How to Obtain the Most Error-Free Estimate of Reliability? Eight Sources of Deflation in the Estimates of Reliability to Avoid

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022

The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…

Descriptors: Test Reliability, Scores, Test Items, Correlation

Goodman-Kruskal Gamma and Dimension-Corrected Gamma in Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2021

Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…

Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation

Illuminating the Post-Graduation Impact of Undergraduate Participation in High-Impact Practices Using Propensity Score Analysis with Structural Equation Modeling

Peer reviewed

Direct link

Joanna L. Dickert; Jian Li – Research in Higher Education, 2024

As colleges and universities grapple with uncertainty around current and future enrollment as well as increasingly vocal questions about the value of postsecondary education, it is critically important that institutions ascertain and invest in the elements of campus learning and engagement that add value to the undergraduate experience. This study…

Descriptors: College Graduates, Student Participation, Educational Practices, Longitudinal Studies

Explained: Artificial Intelligence for Propensity Score Estimation in Multilevel Educational Settings

Peer reviewed
PDF on ERIC

Download full text

Collier, Zachary K.; Zhang, Haobai; Liu, Liu – Practical Assessment, Research & Evaluation, 2022

Although educational research and evaluation generally occur in multilevel settings, many analyses ignore cluster effects. Neglecting the nature of data from educational settings, especially in non-randomized experiments, can result in biased estimates with long-term consequences. Our manuscript improves the availability and understanding of…

Descriptors: Artificial Intelligence, Probability, Scores, Educational Research

When Should Individual Ability Estimates Be Reported if Rapid Guessing Is Present?

Peer reviewed

Direct link

Rios, Joseph A. – Applied Measurement in Education, 2022

Testing programs are confronted with the decision of whether to report individual scores for examinees that have engaged in rapid guessing (RG). As noted by the "Standards for Educational and Psychological Testing," this decision should be based on a documented criterion that determines score exclusion. To this end, a number of heuristic…

Descriptors: Testing, Guessing (Tests), Academic Ability, Scores

Investigating Constructed-Response Scoring over Time: The Effects of Study Design on Trend Rescore Statistics. Research Report. ETS RR-22-15

Peer reviewed
PDF on ERIC

Download full text

Donoghue, John R.; McClellan, Catherine A.; Hess, Melinda R. – ETS Research Report Series, 2022

When constructed-response items are administered for a second time, it is necessary to evaluate whether the current Time B administration's raters have drifted from the scoring of the original administration at Time A. To study this, Time A papers are sampled and rescored by Time B scorers. Commonly the scores are compared using the proportion of…

Descriptors: Item Response Theory, Test Construction, Scoring, Testing

Conditional Subscore Reporting Using Iterated Discrete Convolutions

Peer reviewed

Direct link

Feinberg, Richard A.; von Davier, Matthias – Journal of Educational and Behavioral Statistics, 2020

The literature showing that subscores fail to add value is vast; yet despite their typical redundancy and the frequent presence of substantial statistical errors, many stakeholders remain convinced of their necessity. This article describes a method for identifying and reporting unexpectedly high or low subscores by comparing each examinee's…

Descriptors: Scores, Probability, Statistical Distributions, Ability

The Winning Probability of a Game and the Importance of Points in Tennis Matches

Peer reviewed

Direct link

Sim, Min Kyu; Choi, Dong Gu – Research Quarterly for Exercise and Sport, 2020

Purpose: This study builds a stochastic model of a discrete-time Markov chain (DTMC) that fits well with a dataset of professional playing records. Methods: The point-by-point dataset of Men's single matches played in the Association of Tennis Professionals (ATP) tour from 2011 to 2015 is analyzed. A long-debated assumption on the…

Descriptors: Probability, Racquet Sports, Scores, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 22

Journal of Educational and…	12
Educational and Psychological…	11
International Journal of…	8
ETS Research Report Series	7
Applied Psychological…	6
Journal of College Student…	6
Journal of Educational…	6
Multivariate Behavioral…	6
Research in Higher Education	6
Journal of Educational…	5
Practical Assessment,…	5
American Educational Research…	4
Applied Measurement in…	4
Education Economics	4
Educational Evaluation and…	4
Journal of Experimental…	4
Measurement:…	4
Psychometrika	4
Child Abuse & Neglect: The…	3
Community College Review	3
Developmental Psychology	3
Economics of Education Review	3
Education Finance and Policy	3
Education and Information…	3
International Journal of…	3
More ▼

Chan, Wendy	3
Hughes, Jan N.	3
Sinharay, Sandip	3
Zumbo, Bruno D.	3
Afacan, Kemal	2
Attewell, Paul	2
Austin, Peter C.	2
Babo, Gerard	2
Beaujean, A. Alexander	2
Bonner, Sarah M.	2
Collier, Zachary K.	2
Coohey, Carol	2
Erdogan, Niyazi	2
Ferrando, Pere J.	2
Finch, W. Holmes	2
French, Brian F.	2
Haberman, Shelby J.	2
Justin, Whitney	2
Liu, Ou Lydia	2
Liu, Ren	2
Liu, Yan	2
Meijer, Rob R.	2
Metsämuuronen, Jari	2
Petscher, Yaacov	2
More ▼

SAT (College Admission Test)	16
ACT Assessment	5
Early Childhood Longitudinal…	4
Woodcock Johnson Tests of…	4
Child Behavior Checklist	3
National Longitudinal Survey…	3
Peabody Picture Vocabulary…	3
Program for International…	3
Test of English as a Foreign…	3
Wechsler Intelligence Scale…	3
Florida Comprehensive…	2
Graduate Record Examinations	2
National Assessment of…	2
Stanford Achievement Tests	2
Autism Diagnostic Observation…	1
Beck Depression Inventory	1
Beery Developmental Test of…	1
Beginning Postsecondary…	1
Behavior Assessment System…	1
Behavior Problem Checklist	1
Connecticut Mastery Testing…	1
Defining Issues Test	1
Diagnostic Interview Schedule…	1
Education Longitudinal Study…	1
Eyberg Child Behavior…	1
More ▼