NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Does not meet standards1
Showing 586 to 600 of 3,316 results Save | Export
Gulsah Gurkan – ProQuest LLC, 2021
Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…
Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Rollins, Derrick, Sr. – Chemical Engineering Education, 2017
Statistical inference simply means to draw a conclusion based on information that comes from data. Error bars are the most commonly used tool for data analysis and inference in chemical engineering data studies. This work demonstrates, using common types of data collection studies, the importance of specifying the statistical model for sound…
Descriptors: Data Analysis, Statistical Inference, Chemical Engineering, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Yan; Kim, Eun Sook; Nguyen, Diep Thi; Pham, Thanh Vinh; Chen, Yi-Hsin; Yi, Zhiyao – AERA Online Paper Repository, 2017
The analysis of variance (ANOVA) F test is a commonly used method to test the mean equality among two or more populations. A critical assumption of ANOVA is homogeneity of variance (HOV), that is, the compared groups have equal variances. Although it is encouraged to test HOV as part of the regular ANOVA procedure, the efficacy of the initial HOV…
Descriptors: Statistical Analysis, Error of Measurement, Robustness (Statistics), Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Philipp, Michel; Strobl, Carolin; de la Torre, Jimmy; Zeileis, Achim – Journal of Educational and Behavioral Statistics, 2018
Cognitive diagnosis models (CDMs) are an increasingly popular method to assess mastery or nonmastery of a set of fine-grained abilities in educational or psychological assessments. Several inference techniques are available to quantify the uncertainty of model parameter estimates, to compare different versions of CDMs, or to check model…
Descriptors: Computation, Error of Measurement, Models, Cognitive Measurement
Zhang, Xue; Wang, Chun; Tao, Jian – Grantee Submission, 2018
Testing item-level fit is important in scale development to guide item revision/deletion. Many item-level fit indices have been proposed in literature, yet none of them were directly applicable to an important family of models, namely, the higher order item response theory (HO-IRT) models. In this study, chi-square-based fit indices (i.e., Yen's…
Descriptors: Item Response Theory, Models, Test Items, Goodness of Fit
Hyunsuk Han – ProQuest LLC, 2018
In Huggins-Manley & Han (2017), it was shown that WLSMV global model fit indices used in structural equating modeling practice are sensitive to person parameter estimate RMSE and item difficulty parameter estimate RMSE that results from local dependence in 2-PL IRT models, particularly when conditioning on number of test items and sample size.…
Descriptors: Models, Statistical Analysis, Item Response Theory, Evaluation Methods
Pei-Hsuan Chiu – ProQuest LLC, 2018
Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…
Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Jeffry White – Journal of Educational Research and Practice, 2024
Violations of normality and homogeneity are common in educational data. When this occurs, the use of parametric statistics may be inappropriate. A generalized form of nonparametric analyses based on the Puri and Sen L statistic provides an alternative approach. Using a chi-square distribution, this technique is easy to apply and has significant…
Descriptors: Nonparametric Statistics, Learning Analytics, Evaluation Methods, Guidance
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022
In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…
Descriptors: Evaluators, Training, Comparative Analysis, Academic Language
Peer reviewed Peer reviewed
Direct linkDirect link
Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023
Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
Stensen, Kenneth; Lydersen, Stian; Ranøyen, Ingunn; Klöckner, Christian A.; Buøen, Elisabet S.; Lekhal, Ratib; Drugli, May Britt – Journal of Psychoeducational Assessment, 2023
The Student-Teacher Relationship Scale-Short Form (STRS-SF) is one of the most frequently used instruments globally to measure professional caregivers' perceptions of the relationship quality with a specific child. However, its psychometric properties for children younger than 3 years of age enrolled in early childhood education and care (ECEC)…
Descriptors: Foreign Countries, Teacher Student Relationship, Teacher Attitudes, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Putman, S. Michael; Wang, Chuang; Rickelman, Bob; Crossley, Antony; Mittag, Waldemar – Education and Information Technologies, 2020
Competent use of the Internet to locate information is an important skill for today's youth. Yet, many lack the knowledge and dispositions to engage in the processes necessary to effectively and efficiently find information on the Internet. As a result, various countries have incorporated references to the processes of online inquiry within their…
Descriptors: Inquiry, Online Searching, International Assessment, Academic Standards
Peer reviewed Peer reviewed
Direct linkDirect link
Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2020
This study presents new models for item response functions (IRFs) in the framework of the D-scoring method (DSM) that is gaining attention in the field of educational and psychological measurement and largescale assessments. In a previous work on DSM, the IRFs of binary items were estimated using a logistic regression model (LRM). However, the LRM…
Descriptors: Item Response Theory, Scoring, True Scores, Scaling
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Selvi, Hüseyin; Alici, Devrim; Uzun, Nezaket Bilge – Asian Journal of Education and Training, 2020
This study aims to comparatively examine the resultant findings by testing the measurement invariance with structural equation modeling in cases where the missing data is handled using the expectation-maximization (EM), regression imputation, and mean substitution methods in the complete data matrix and the 5% missing data matrix that is randomly…
Descriptors: Error of Measurement, Structural Equation Models, Attitude Measures, Student Attitudes
Peer reviewed Peer reviewed
Direct linkDirect link
Gorgun, Guher; Bulut, Okan – Educational and Psychological Measurement, 2021
In low-stakes assessments, some students may not reach the end of the test and leave some items unanswered due to various reasons (e.g., lack of test-taking motivation, poor time management, and test speededness). Not-reached items are often treated as incorrect or not-administered in the scoring process. However, when the proportion of…
Descriptors: Scoring, Test Items, Response Style (Tests), Mathematics Tests
Pages: 1  |  ...  |  36  |  37  |  38  |  39  |  40  |  41  |  42  |  43  |  44  |  ...  |  222