NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 1 to 15 of 72 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Umut Atasever; John Jerrim; Sabine Tieck – Educational Assessment, Evaluation and Accountability, 2024
Cross-national comparisons of educational achievement rely upon each participating country collecting nationally representative data. While obtaining high response rates is a key part of reaching this goal, other potentially important factors may also be at play. This paper focuses on one such issue--exclusion rates--which has received relatively…
Descriptors: International Assessment, Comparative Analysis, Cross Cultural Studies, Research Problems
Peer reviewed Peer reviewed
Direct linkDirect link
Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023
Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…
Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems
Egamaria Alacam; Craig K. Enders; Han Du; Brian T. Keller – Grantee Submission, 2023
Composite scores are an exceptionally important psychometric tool for behavioral science research applications. A prototypical example occurs with self-report data, where researchers routinely use questionnaires with multiple items that tap into different features of a target construct. Item-level missing data are endemic to composite score…
Descriptors: Regression (Statistics), Scores, Psychometrics, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Qianying; Liao, Jing; Lapata, Mirella; Macleod, Malcolm – Research Synthesis Methods, 2022
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing…
Descriptors: Risk, Natural Language Processing, Medical Research, Networks
Peer reviewed Peer reviewed
Direct linkDirect link
Little, Todd D.; Chang, Rong; Gorrall, Britt K.; Waggenspack, Luke; Fukuda, Eriko; Allen, Patricia J.; Noam, Gil G. – International Journal of Behavioral Development, 2020
We revisit the merits of the retrospective pretest-posttest (RPP) design for repeated-measures research. The underutilized RPP method asks respondents to rate survey items twice during the same posttest measurement occasion from two specific frames of reference: "now" and "then." Individuals first report their current attitudes…
Descriptors: Pretesting, Alternative Assessment, Program Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
McIntosh, James – European Education, 2019
This article examines whether the way that PISA models item outcomes in mathematics affects the validity of its country rankings. As an alternative to PISA methodology, a two-parameter logistic model is applied to PISA mathematics item data from Italy and Spain for the year 2009. In the estimation procedure, item difficulty and dispersion…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Bornmann, Lutz – Higher Education: The International Journal of Higher Education Research, 2017
Impact of science is one of the most important topics in scientometrics. Recent developments show a fundamental change in impact measurements from impact on science to impact on society. Since impact measurement is currently in a state of far reaching changes, this paper describes recent developments and facing problems in this area. For that, the…
Descriptors: Research Methodology, Bibliometrics, Science and Society, Measurement
Moore, Joann; Huang, Chi-Yu; Huh, Noo-Ree; Li, Tianli; Camara, Wayne – ACT, Inc., 2018
In the fall of 2017, ACT began providing a limited number of supports (also known as accommodations) to English Learner (EL) students in the US taking the ACT® test. The goal of the supports is to remove construct-irrelevant variance in students' scores related to limited English proficiency and allow students to more accurately demonstrate their…
Descriptors: English Language Learners, Testing Accommodations, College Entrance Examinations, Evaluation Research
Peer reviewed Peer reviewed
Direct linkDirect link
Lane, David; Oswald, Frederick L. – Educational Measurement: Issues and Practice, 2016
The educational literature, the popular press, and educated laypeople have all echoed a conclusion from the book "Academically Adrift" by Richard Arum and Josipa Roksa (which has now become received wisdom), namely, that 45% of college students showed no significant gains in critical thinking skills. Similar results were reported by…
Descriptors: College Students, Critical Thinking, Thinking Skills, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Brown, Gavin T. L.; Andrade, Heidi L.; Chen, Fei – Assessment in Education: Principles, Policy & Practice, 2015
Student self-assessment is a central component of current conceptions of formative and classroom assessment. The research on self-assessment has focused on its efficacy in promoting both academic achievement and self-regulated learning, with little concern for issues of validity. Because reliability of testing is considered a sine qua non for the…
Descriptors: Accuracy, Self Evaluation (Individuals), Students, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Piccone, Jason E. – Journal of Correctional Education, 2015
The effective evaluation of correctional programs is critically important. However, research in corrections rarely allows for the randomization of offenders to conditions of the study. This limitation compromises internal validity, and thus, causal conclusions can rarely be drawn. Increasingly, researchers are employing propensity score matching…
Descriptors: Correctional Education, Program Evaluation, Probability, Scores
Cecile C. Dietrich; Eric J. Lichtenberger – Sage Research Methods Cases, 2016
We present a case study of the process through which a methodology was developed and applied to a quasi-experimental research study that employed propensity score matching. Methodological decisions are discussed and summarized, including an explanation of the approaches selected for each step in the study as well as rationales for these…
Descriptors: Test Construction, Quasiexperimental Design, Community Colleges, Fees
Custer, Michael – Online Submission, 2015
This study examines the relationship between sample size and item parameter estimation precision when utilizing the one-parameter model. Item parameter estimates are examined relative to "true" values by evaluating the decline in root mean squared deviation (RMSD) and the number of outliers as sample size increases. This occurs across…
Descriptors: Sample Size, Item Response Theory, Computation, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Andersson, C.; Antelius, J.; Månsson, J.; Sund, K. – Scandinavian Journal of Educational Research, 2017
This study investigates technical efficiency and productivity for Swedish higher education institutions (HEIs). One identified problem in previous research concerns adjusting efficiency scores for input quality. This problem is avoided using grades from upper-secondary schools. A second problem concerns heterogeneity with respect to subjects and…
Descriptors: Efficiency, Productivity, Higher Education, Statistical Analysis
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5