Statistical and Mathematical Learning: an application to fraud detection and prevention

dc.contributor.advisorBaxter, Jeremy
dc.contributor.advisorAtemkeng Teufack, Marcellin
dc.contributor.authorHamlomo, Sisipho
dc.date.accessioned2026-03-04T07:03:14Z
dc.date.issued6/4/2022
dc.description.abstractCredit card fraud is an ever-growing problem. There has been a rapid increase in the rate of fraudulent activities in recent years resulting in a considerable loss to several organizations, companies, and government agencies. Many researchers have focused on detecting fraudulent behaviours early using advanced machine learning techniques. However, credit card fraud detection is not a straightforward task since fraudulent behaviours usually differ for each attempt and the dataset is highly imbalanced, that is, the frequency of non-fraudulent cases outnumbers the frequency of fraudulent cases. In the case of the European credit card dataset, we have a ratio of approximately one fraudulent case to five hundred and seventy-eight non-fraudulent cases. Different methods were implemented to overcome this problem, namely random undersampling, one-sided sampling, SMOTE combined with Tomek links and parameter tuning. Predictive classifiers, namely logistic regression, decision trees, k-nearest neighbour, support vector machine and multilayer perceptrons, are applied to predict if a transaction is fraudulent or non-fraudulent. The model's performance is evaluated based on recall, precision, F1-score, the area under receiver operating characteristics curve, geometric mean and Matthew correlation coefficient. The results showed that the logistic regression classifier performed better than other classifiers except when the dataset was oversampled.
dc.description.degreeMaster's thesis
dc.format.extent161 pages
dc.format.mimetypeapplication/pdf
dc.identifier.otherhttp://hdl.handle.net/10962/233795
dc.identifier.urihttps://researchrepository.ru.ac.za/handle/123456789/4729
dc.languageEnglish
dc.publisherRhodes University, Faculty of Science, Department of Statistics
dc.rightsHamlomo, Sisipho
dc.subjectCredit card fraud
dc.subjectBootstrap (Statistics)
dc.subjectSupport vector machines
dc.subjectNeural networks (Computer science)
dc.subjectDecision trees
dc.subjectMachine learning
dc.subjectCross-validation
dc.subjectImbalanced data
dc.titleStatistical and Mathematical Learning: an application to fraud detection and prevention
dc.typeAcademic thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Statistical_and_Mathematical_Learning__an_applicat_vital_50128.pdf
Size:
1.63 MB
Format:
Adobe Portable Document Format