NotesFAQContact Us
Collection
Advanced
Search Tips
Back to results
Peer reviewed Peer reviewed
Direct linkDirect link
ERIC Number: EJ1400817
Record Type: Journal
Publication Date: 2023
Pages: 19
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1360-2357
EISSN: EISSN-1573-7608
Available Date: N/A
Student Performance Prediction Using Datamining Classification Algorithms: Evaluating Generalizability of Models from Geographical Aspect
Parhizkar, Amirmohammad; Tejeddin, Golnaz; Khatibi, Toktam
Education and Information Technologies, v28 n11 p14167-14185 2023
Increasing productivity in educational systems is of great importance. Researchers are keen to predict the academic performance of students; this is done to enhance the overall productivity of educational system by effectively identifying students whose performance is below average. This universal concern has been combined with data science leading to the creation of an interdisciplinary research area called Educational Data Mining. One of the recent issues which has been addressed by researchers is training generalizable models from different aspects such as gender, major, geography and etc. Therefore, in this research we use machine learning methods to predict student's performance, emphasizing on training generalizable models from geographical aspect. For this purpose, a questionnaire containing 37 questions was designed, through which 536 answers were collected, including 111 international and 425 domestic answers. According to the literature, student performance is mostly determined based on the GPA (grade point average) of the entire course. In this research, information about the GPA of respondents in undergraduate and graduate courses was collected in the form of three classes. After a final review of the models employed in previous studies, the main models selected and used for classification purposes included SVM, CNN, Adaboost, RF, SVM, and DT. Feature selection is performed using XGBoost, random forest, as well as SVML1. The main issue investigated in this study is the generalizability of the models trained on domestic (Iranian) data and tested on international data (non-Iranian). Experimental results show that the best models trained with specific dataset collected in this research had generalizability comparing to base models' outcomes which were trained and tested on domestic data. Meanwhile, Random forest and CNN models shows the best performance with the average of accuracy and F-score of 73.5 and 68.5, respectively.
Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://link-springer-com.bibliotheek.ehb.be/
Publication Type: Journal Articles; Reports - Research
Education Level: Higher Education; Postsecondary Education
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A