Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/63155
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2020-11-02T11:36:44Z-
dc.date.available2020-11-02T11:36:44Z-
dc.date.issued2020-
dc.identifier.citationBezzina, R. (2020). A review of imbalanced data techniques with application to loan default (Bachelor's dissertation).en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/63155-
dc.descriptionB.SC.(HONS)STATS.&OP.RESEARCHen_GB
dc.description.abstractCustomer loan default occurs when a borrower does not honour the loan repayment programme agreed with the bank. In accounting standards terms and banking regulation, a loan is deemed to be in default when repayment of capital and/or interest fall in arrears by 90 days or more. In the case of personal loans, which is the basis of this thesis, a borrower can default due to loss of income caused by, for example, redundancy and loss of income generating assets. The lenders’ mitigation of the risk of loss depends on its ability to adequately evaluate credit risk. The lender’s objective is to increase revenue by holding a loan portfolio containing predominantly well performing loans at an interest rate reflecting the relative risk, while at the same time minimising losses resulting from defaulted loans. Tools which can be used for risk mitigation purposes are tree-based methods. However, one of the disadvantages of these methods is that of imbalanced data. The focus of this dissertation is the application of techniques to overcome this issue and thus generate accurate predictions from these tree-based methods for classification. The theory of classification trees will also be discussed where decision trees will be discussed as the foundation to understand bagged trees. Bagged trees will be applied on a real-life dataset after applying various techniques used to remove class imbalance within a dataset, namely SMOTE, Borderline SMOTE, ADASYN, Safe-Level SMOTE and SMOTE-NC and the relative results will be compared. Logistic regression will also be applied as it is a benchmark of statistical models for classification.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectLoansen_GB
dc.subjectArtificial intelligenceen_GB
dc.subjectComputer communication systemsen_GB
dc.titleA review of imbalanced data techniques with application to loan defaulten_GB
dc.typebachelorThesisen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.publisher.institutionUniversity of Maltaen_GB
dc.publisher.departmentFaculty of Science. Department of Statistics and Operations Researchen_GB
dc.description.reviewedN/Aen_GB
dc.contributor.creatorBezzina, Rachel-
Appears in Collections:Dissertations - FacSci - 2020
Dissertations - FacSciSOR - 2020

Files in This Item:
File Description SizeFormat 
20BSCMSOR002.pdf
  Restricted Access
3.36 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.