USP Electronic Research Repository

Using ensemble decision tree model to predict student dropout in computing science

Naseem, Mohammed and Chaudhary, Kaylash C. and Sharma, Bibhya N. and Lal, Aman (2020) Using ensemble decision tree model to predict student dropout in computing science. [Conference Proceedings]

[thumbnail of Using_Ensemble_Decision_Tree_Model_to_Predict_Student_Dropout_in_Computing_Science.pdf]
Preview
PDF - Submitted Version
Download (224kB) | Preview

Abstract

Science, Technology, Engineering and Mathematics (STEM) professionals play a key role in the development of an economy. STEM workers are critical thinkers as they contribute immensely by driving innovations. There is a high demand for professionals in the STEM fields but there is also a shortage of human resource in these areas. One way to reduce this problem is by identifying students who are at-risk of dropping out and then intervening with focused strategies that will ensure that these students remain in same the programme till graduation. Therefore, this research aims to use a data mining classification technique to identify students who are at-risk of dropping out from their Computing Science (CS) degree programmes. The Random Forest (RF) decision tree algorithm is used to learn patterns from historical data about first-year undergraduate CS students who are enrolled in a tertiary institute in the South Pacific. A number of factors are used which comprise of students demographic information, previous education background, financial information as well as data about students' academic interaction. Feature selection is performed to determine which factors have greater influence in students' decision in dropping out. Cross-validation techniques are used to ensure that the models are not over-fitted. Two models were built using a 5fold and 10-fold cross-validation and the results were compared using several measures of model performance. The results show that the factors corresponding to students' academic performance in a first-year programming course had the greatest impact student attrition in CS.

Item Type: Conference Proceedings
Subjects: Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Science, Technology and Environment (FSTE) > School of Computing, Information and Mathematical Sciences
Depositing User: Ms Shalni Sanjana
Date Deposited: 20 Apr 2021 01:20
Last Modified: 28 Mar 2022 01:14
URI: https://repository.usp.ac.fj/id/eprint/12763

Actions (login required)

View Item View Item