Publication Date
In 2025 | 2 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 42 |
Since 2016 (last 10 years) | 227 |
Since 2006 (last 20 years) | 492 |
Descriptor
Source
Author
Petscher, Yaacov | 5 |
Patterson, Brian F. | 4 |
Sinharay, Sandip | 4 |
Allen, Jeff | 3 |
Allensworth, Elaine M. | 3 |
Bonner, Sarah M. | 3 |
Chan, Wendy | 3 |
Choi, Kilchan | 3 |
Foorman, Barbara R. | 3 |
Hughes, Jan N. | 3 |
Marini, Jessica P. | 3 |
More ▼ |
Publication Type
Education Level
Location
Florida | 24 |
Texas | 17 |
Illinois | 13 |
California | 11 |
New York | 10 |
Ohio | 10 |
North Carolina | 9 |
Georgia | 8 |
Missouri | 8 |
Pennsylvania | 8 |
Massachusetts | 7 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 2 |
Meets WWC Standards with or without Reservations | 5 |
Does not meet standards | 5 |
San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022
The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…
Descriptors: Tests, Scores, Statistical Analysis, Models
Wendy Chan – Asia Pacific Education Review, 2024
As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…
Descriptors: Probability, Scores, Causal Models, Statistical Inference
Rajeeb Das; Erika Schmitt; Michael T. Stephenson – Journal of College Student Retention: Research, Theory & Practice, 2024
First-year seminars (FYS) comprise one of 11 researched interventions in postsecondary education known as High-Impact Practices, but few rigorous studies report significantly high impacts. This study examined a FYS employing propensity score matching to link cases and controls in a quasi-experimental design. One semester later cumulative grade…
Descriptors: College Freshmen, First Year Seminars, Scores, Probability
Wendy Chan; Jimin Oh; Chen Li; Jiexuan Huang; Yeran Tong – Society for Research on Educational Effectiveness, 2023
Background: The generalizability of a study's results continues to be at the forefront of concerns in evaluation research in education (Tipton & Olsen, 2018). Over the past decade, statisticians have developed methods, mainly based on propensity scores, to improve generalizations in the absence of random sampling (Stuart et al., 2011; Tipton,…
Descriptors: Generalizability Theory, Probability, Scores, Sampling
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022
The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…
Descriptors: Equated Scores, Test Items, Scores, Probability
Grant Clayton; Joseph Taylor – Sage Research Methods Cases, 2024
In the United States high school students may take college level courses that count for credit in both high school and college, often called dual enrollment. Students may take courses at their high school from certified instructors or at a local college. We investigate possible differences between these two groups of students using student level…
Descriptors: Probability, Scores, Computation, Dual Enrollment
Collier, Zachary K.; Leite, Walter L. – Journal of Experimental Education, 2022
Artificial neural networks (NN) can help researchers estimate propensity scores for quasi-experimental estimation of treatment effects because they can automatically detect complex interactions involving many covariates. However, NN is difficult to implement due to the complexity of choosing an algorithm for various treatment levels and monitoring…
Descriptors: Artificial Intelligence, Mentors, Beginning Teachers, Teacher Persistence
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Chan, Wendy – American Journal of Evaluation, 2022
Over the past ten years, propensity score methods have made an important contribution to improving generalizations from studies that do not select samples randomly from a population of inference. However, these methods require assumptions and recent work has considered the role of bounding approaches that provide a range of treatment impact…
Descriptors: Probability, Scores, Scoring, Generalization
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022
The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…
Descriptors: Test Reliability, Scores, Test Items, Correlation
Rickel, Jessica; Szatrowski, Alisa; Rosemond, Charlie – Online Submission, 2023
Using Coarsened Exact Matching (CEM), this analysis examined the impact of Zearn Math across 7,116 matched pairs of students in 31 Louisiana parishes. Students who completed an average of 3+ Zearn Math lessons per week during the 2021-2022 school year were compared to similarly matched peers who completed less than 1 lesson per week. Findings…
Descriptors: Mathematics Instruction, Teaching Methods, Instructional Effectiveness, Program Implementation
Metsämuuronen, Jari – International Journal of Educational Methodology, 2021
Although Goodman-Kruskal gamma (G) is used relatively rarely it has promising potential as a coefficient of association in educational settings. Characteristics of G are studied in three sub-studies related to educational measurement settings. G appears to be unexpectedly appealing as an estimator of association between an item and a score because…
Descriptors: Educational Assessment, Measurement, Item Analysis, Correlation
Harmston, Matt – ACT, Inc., 2020
Out of over 1.9 million students in the 2018 ACT-tested graduating class, 44% took the full ACT® test at least twice in hopes of improving their scores. Camara and Allen (2017) support this practice, confirming that factors such as time in the classroom between test administrations are associated with increases in ACT Composite scores from the…
Descriptors: College Entrance Examinations, Repetition, Scores, Probability
Joanna L. Dickert; Jian Li – Research in Higher Education, 2024
As colleges and universities grapple with uncertainty around current and future enrollment as well as increasingly vocal questions about the value of postsecondary education, it is critically important that institutions ascertain and invest in the elements of campus learning and engagement that add value to the undergraduate experience. This study…
Descriptors: College Graduates, Student Participation, Educational Practices, Longitudinal Studies
Collier, Zachary K.; Zhang, Haobai; Liu, Liu – Practical Assessment, Research & Evaluation, 2022
Although educational research and evaluation generally occur in multilevel settings, many analyses ignore cluster effects. Neglecting the nature of data from educational settings, especially in non-randomized experiments, can result in biased estimates with long-term consequences. Our manuscript improves the availability and understanding of…
Descriptors: Artificial Intelligence, Probability, Scores, Educational Research