Statistical classification, an application to credit default

dc.contributor.advisorBaxter, Jeremy
dc.contributor.authorSikhakhane, Anele Gcina
dc.date.accessioned2026-03-02T13:48:05Z
dc.date.issued11/10/2024
dc.description.abstractStatistical learning has been used in both industry and academia to create credit scoring models. These models are used to predict who might default on their loan repayments, thus minimizing the risk financial institutions face. In this study six traditional and one more recent classifier, namely kNN, LDA, CART, RF, AdaBoost, XGBoost and SynBoost were used to predict who might default on their loans. The data set used in this study was imbalanced thus sampling and performance evaluation techniques were investigated and used to balance the class distribution and assess the classifiers performance. In addition to the standard variables and data set, new variables called synthetic variables and synthetic data sets were produced, investigated and used to predict who might default on their loans. This study found that the synthetic data set had strong predictive power and sampling methods negatively affected the classifiers performance. The best-performing classifier was XGBoost, with an AUC score of 0.7732.
dc.description.degreeMaster's thesis
dc.format.extent160 pages
dc.format.mimetypeapplication/pdf
dc.identifier.otherhttp://hdl.handle.net/10962/465069
dc.identifier.urihttps://researchrepository.ru.ac.za/handle/123456789/3225
dc.languageEnglish
dc.publisherRhodes University, Faculty of Science, Department of Statistics
dc.rightsSikhakhane, Anele Gcina
dc.subjectBinary classification
dc.subjectDefault (Finance)
dc.subjectCredit cards
dc.subjectCredit risk
dc.subjectMachine learning
dc.subjectVariables (Mathematics)
dc.titleStatistical classification, an application to credit default
dc.typeAcademic thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Statistical_classification__an_application_to_cred_vital_76570.pdf
Size:
1 MB
Format:
Adobe Portable Document Format