Please use this identifier to cite or link to this item:
Title: Detection of outliers : a data mining approach
Authors: Fiott, Theresa (2015)
Keywords: Outliers (Statistics)
Regression analysis
Data mining
Issue Date: 2015
Citation: Fiott, T. (2015). Detection of outliers : a data mining approach (Bachelor's dissertation).
Abstract: Data is constantly being generated from daily life. An outlier in a set of data is an observation or a point that is considerably dissimilar or inconsistent with the remainder of the data. Outliers could potentially represent the consequential elements in the data. Analogous rules exist in where a small percentage of root causes generate a bulk of failures in networks and software. Managing to find this crucial information by sifting through the data is a very sought after exercise. Outliers can represent an error, or justifiable data which means that they can also be inspiration for further research. An outlier might enlighten researchers on an important principle or issue. Before removing outliers, researchers need to question whether that data contains valuable information that perhaps might not even relate to the intended study, but has importance in a more global sense. In this project, the Local Outlier Factor (LOF) algorithm is applied to a chosen dataset to calculate the respective computed outlier scores of each record in the chosen dataset. This involves various steps which are outlined in the documentation. The outcome of this project is to reinforce the notion that outliers shouldn't be immediately discarded and thought of as noise or errors in the data. In this project outliers will be identified for their potential to represent new and previously unexplored relationships amongst the existing attributes in the dataset.
Description: B.Sc. IT (Hons)(Melit.)
Appears in Collections:Dissertations - FacICT - 2015

Files in This Item:
File Description SizeFormat 
  Restricted Access
11.34 MBAdobe PDFView/Open Request a copy

Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.