Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/25721
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2018-01-12T15:00:17Z-
dc.date.available2018-01-12T15:00:17Z-
dc.date.issued2017-
dc.identifier.urihttps://www.um.edu.mt/library/oar//handle/123456789/25721-
dc.descriptionM.SC.ITen_GB
dc.description.abstractResearchers have advocated that a pre-processing step is needed before applying massive time series data to high computational applications such as data mining algorithms, even if this introduces a reduction in the quality and nature of the original time series data. During the last two decades various time series dimensionality reduction techniques have been proposed in the literature to serve as a pre-processing step; one of these is numerosity reduction. Numerosity reduction gives excellent response time on complex data mining algorithms when comparing the same process over the raw time series. However no study have been dedicated to compare these time series dimensionality reduction techniques in terms of their effectiveness of producing a good representation that when applied to various data mining algorithms produces accurate results. The study selected nine well known times series datasets, applied four reduction techniques with five levels of reductions, and five knowledge extraction techniques also well known for time series mining. For each permutation we applied post processing evaluation metrics and produced an average accuracy level when compared to the same time series and data mining procedure on the raw time series. It has been shown that the Piecewise Aggregate Approximation (PAA) is able to produce results with more than 70% accuracy when applied to a partitional, hierarchical and classification algorithm. Furthermore, a Symbolic Aggregate Approximation (SAX) representation with a larger alphabet size produces higher accurate results than a SAX representation with a smaller alphabet size. On the other hand, the Discrete Wavelet Transform (DWT) produces results of lower accuracy than both the PAA and SAX techniques. Results have indicated a change in results’ accuracy levels from clustering and classification algorithms to the motif and discord discovery algorithms. The former algorithms produced results of higher accuracy than the latter algorithms.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectData miningen_GB
dc.subjectAlgorithmsen_GB
dc.subjectDimension reduction (Statistics)en_GB
dc.titleMining massive time series data : with dimensionality reduction techniquesen_GB
dc.typemasterThesisen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.publisher.institutionUniversity of Maltaen_GB
dc.publisher.departmentFaculty of Information and Communication Technology. Department of Computer Information Systemsen_GB
dc.description.reviewedN/Aen_GB
dc.contributor.creatorBorg, Justin-
Appears in Collections:Dissertations - FacICT - 2017
Dissertations - FacICTCIS - 2017

Files in This Item:
File Description SizeFormat 
17MCIS001.pdf
  Restricted Access
2.3 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.