<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>OAR@UM Collection:</title>
    <link>https://www.um.edu.mt/library/oar/handle/123456789/25578</link>
    <description />
    <pubDate>Thu, 07 May 2026 20:22:20 GMT</pubDate>
    <dc:date>2026-05-07T20:22:20Z</dc:date>
    <item>
      <title>Application and improvement of genetic algorithms and genetic programming towards the fight against spam and other internet malware</title>
      <link>https://www.um.edu.mt/library/oar/handle/123456789/73653</link>
      <description>Title: Application and improvement of genetic algorithms and genetic programming towards the fight against spam and other internet malware
Abstract: While spam is increasingly acknowledged as a very expensive problem on the Internet, &#xD;
spam filters attempt to detect spam from emails with the highest possible accuracy. &#xD;
Reverse Polish Notation (RPN) expressions are proposed as a means to combine a range &#xD;
of evaluated features from emails for the detection of spam. Theoretical arguments for &#xD;
the use of RPN expressions applied to spam detection are proposed, together with a new &#xD;
RPN Block Representation. Theoretical comparisons of RPN expressions with Naïve &#xD;
Bayes and Support Vector Machine are also given. It is shown that such RPN expressions &#xD;
are more expressive. A proof that email spam detection is NP-complete is given by &#xD;
mapping groups of email spams onto malware virus families. Seventy-two features, &#xD;
ranging from Subject-line, Header-based, Message Body-based, URL-based and &#xD;
stylistic, have been used to evolve RPN expressions using Linear Genetic Programming. &#xD;
New features and specifically the application of a group of URL features to spam &#xD;
detection are proposed (since many spams contain links to domains which are at times &#xD;
even malicious). These new features are shown to be useful for spam detection &#xD;
theoretically and in practice. Linear Genetic Programming is a subset of Genetic &#xD;
Programming where chromosomes are computer programs represented using imperative &#xD;
computer language instructions or machine code instead of trees made up of symbolic &#xD;
expressions. Such machine code can encode RPN expressions. The Linear Genetic &#xD;
Programming system is used to “learn” an RPN expression consisting of a combination &#xD;
of features which can be used to detect spam.&#xD;
A number of feature selection algorithms are used to identify which subsets of features &#xD;
are most relevant to classification. The feature selection techniques Minimum &#xD;
Redundancy Maximum Relevance method (this filter technique finds features which are &#xD;
mutually far from each other while still having high “correlation” to classification), the &#xD;
conventional Maximum Relevance method, Principle Component Analysis and using &#xD;
the entire feature set are investigated using the Linear Genetic Programming system &#xD;
applied to the SpamAssassin Spam Corpus, which is a standard ham and spam archive. &#xD;
Theoretical and practical comparisons with literature results and industry open source &#xD;
applications are given for the proposed system.&#xD;
A new Block Crossover operator which helps preserve and keep intact RPN Blocks as &#xD;
well as a new RPN First Subexpression algorithm are also proposed. This algorithm &#xD;
helps to maintain a double check for spam and ham and results in improvement of &#xD;
detection accuracy and other metrics. The system was very demanding computationally, &#xD;
taking a long time to run on a supercomputer. The proposed system achieved 99.17% &#xD;
accuracy and an F1-score of 0.9868, which compare very well with results given in the &#xD;
literature as well as with performance of industrial Bogofilter and SpamAssassin Spam &#xD;
Filters.
Description: PH.D.</description>
      <pubDate>Sun, 01 Jan 2017 00:00:00 GMT</pubDate>
      <guid isPermaLink="false">https://www.um.edu.mt/library/oar/handle/123456789/73653</guid>
      <dc:date>2017-01-01T00:00:00Z</dc:date>
    </item>
    <item>
      <title>Genetic algorithm based metaheuristic optimisation of machine learning algorithm parameters</title>
      <link>https://www.um.edu.mt/library/oar/handle/123456789/27791</link>
      <description>Title: Genetic algorithm based metaheuristic optimisation of machine learning algorithm parameters
Abstract: through design and experimentation. The search for optimal or near optimal algorithm&#xD;
configuration parameters is one way to improve performance and presents a research&#xD;
area of significant interest not only to researchers in the field, but also to developers of&#xD;
new algorithms and data analysts in diverse scientific fields.&#xD;
This optimisation process involves a search of very often a large, possibly infinite,&#xD;
unique and irregular parameter search space which renders the search for optimality or&#xD;
near optimality difficult in feasible time. Metaheuristic algorithms with properties,&#xD;
such as exploration, exploitation, use of acquired knowledge gained and stochastic&#xD;
elements, render themselves them a suitable class of solutions for this problem.&#xD;
Various studies have explored different metaheuristic or meta-optimisation approaches&#xD;
in the search for ever better machine learner optimisers and more generally applicable&#xD;
ones.&#xD;
The aim of this study was thus to explore and validate the use of a Genetic&#xD;
Algorithm based approach, often selected in studies of specific optimisation problems,&#xD;
as a generally applicable solution to the Machine Learning Algorithm Parameter&#xD;
optimisation problem. The Simple Genetic Algorithm (SGA), in particular, was&#xD;
chosen as the focus for this study because of its clear metaheuristic qualities and its&#xD;
relatively simple and well known evolutionary architecture.&#xD;
The method used to measure the performance of the SGA as a general metaoptimiser&#xD;
was involved the carrying out of a series of experiments, in which the SGA&#xD;
and other meta-optimiser algorithms were applied to the optimisation of a select test&#xD;
base of machine learner algorithms and datasets, with different characteristics and&#xD;
under various experimental conditions. In a novel take over other studies, a close&#xD;
measured look was also taken at the efficiency and effectiveness of the SGA in the&#xD;
meta-optimisation process.&#xD;
This necessitated the design and setup of a significant number of different&#xD;
experiment runs in a set of phased studies. These experiments were run using multiple&#xD;
fold cross-validation at machine learner and meta-optimiser levels for statistical&#xD;
validity, involving around 200 million machine learner evaluations executed over a set&#xD;
of ten standard workstations. The duration of evaluations of individual configurations ranged from a few milliseconds to over one and a half hours. An automated system&#xD;
was designed and developed by the author to run the planned experimentation and&#xD;
gather valuable performance data for subsequent analysis and reporting.&#xD;
The results showed that there were consistent, though not statistically significant,&#xD;
indications that the SGA was on average a good, though not optimal optimiser of&#xD;
machine learning algorithm parameters over the selected test base. It was also found&#xD;
that without tuning, the SGA suffered from inefficiencies which reduced its overall&#xD;
effectiveness.&#xD;
Other secondary results and methodological insights obtained include:&#xD;
1. the visualisation and measurement of the different accuracy parameter space&#xD;
landscapes,&#xD;
2. the measurement of the overheads and inefficiencies of search,&#xD;
3. the effect of evaluation time on machine learner performance,&#xD;
4. the effect of dataset size and machine learner processing costs on metaoptimiser&#xD;
performance,&#xD;
5. the effect of meta-optimiser tuning on its performance,&#xD;
6. the value of low-cost pre-optimisation exploration of the optimisation problem,&#xD;
and&#xD;
7. the general applicability of the measures developed or adapted for this study.&#xD;
Two meta-optimiser algorithms were developed for comparative analysis. One was&#xD;
based on Iterated Local search and employed a sampling of the parameter space&#xD;
neighbourhood of the current candidate for the local search. The other was a hybrid&#xD;
SGA with refinement of the epoch's best candidate through local search using the&#xD;
above neighbourhood sampling.&#xD;
The contribution of this study lies in the sum of its outcomes and the potential it&#xD;
holds for further research opportunities.
Description: PH.D.IT</description>
      <pubDate>Sun, 01 Jan 2017 00:00:00 GMT</pubDate>
      <guid isPermaLink="false">https://www.um.edu.mt/library/oar/handle/123456789/27791</guid>
      <dc:date>2017-01-01T00:00:00Z</dc:date>
    </item>
    <item>
      <title>Link prediction in social graph databases</title>
      <link>https://www.um.edu.mt/library/oar/handle/123456789/25817</link>
      <description>Title: Link prediction in social graph databases
Abstract: The graph structure is widely researched due to its significance in various areas. The&#xD;
topology of such graphs offers a basis of pattern and knowledge extraction by using&#xD;
various mining algorithms. Domains like social activity, continued to give rise to the&#xD;
popularity of graphs, especially in the field of predictive analysis. The dynamic nature&#xD;
of social graphs motivated various researchers to anticipate the evolution of these&#xD;
networks through time. Predicting the likelihood of social interactions formulating at a&#xD;
future time period is based on the Link Prediction Problem. Realizing these links, help&#xD;
to discover future and hidden activities which are very useful for different sectors. A&#xD;
selection of social activities and interactions are not only dynamic, but their strength and&#xD;
reach evolve over time too. Due to this, a number of studies suggest that considering&#xD;
time as an additional dimension improves the results obtained in link prediction. This&#xD;
study evaluates the effect of this consideration by comparing results obtained from static&#xD;
methods with those returned from temporal methods. A supervised binary classification&#xD;
technique is used on three different social datasets with features describing popular&#xD;
graph metrics representing the similarities and proximities between nodes. This study&#xD;
also proposes and implements a method to assign time-based weights which describe&#xD;
the activeness of the network nodes based on how recent their adjacent interactions are.&#xD;
Various performance measures such as accuracy, precision, and recall are used to aid&#xD;
with the comparative analysis of the results. The results of this study show that the&#xD;
consideration of time-based aspects helps improve the link predictions. The Katz metric&#xD;
yielded the best performance when compared to the other graph metrics. This result on&#xD;
one of the datasets managed to correctly classify seventeen additional links when the&#xD;
time-based method is used.
Description: M.SC.IT</description>
      <pubDate>Sun, 01 Jan 2017 00:00:00 GMT</pubDate>
      <guid isPermaLink="false">https://www.um.edu.mt/library/oar/handle/123456789/25817</guid>
      <dc:date>2017-01-01T00:00:00Z</dc:date>
    </item>
    <item>
      <title>E-voting system using blockchain technology</title>
      <link>https://www.um.edu.mt/library/oar/handle/123456789/25816</link>
      <description>Title: E-voting system using blockchain technology
Abstract: In recent years, cryptocurrencies have revolutionised the financial world with the&#xD;
introduction of Bitcoin. The Bitcoin operates in a decentralised network, where&#xD;
everyone is an owner, and no central server exists. This decentralised network is called&#xD;
a blockchain, and since its conception has evolved beyond its primary use. The idea of&#xD;
decentralised applications gave birth to new and innovative ways of designing&#xD;
applications. This idea continued growing with the introduction of smart contracts on&#xD;
another the Ethereum blockchain, giving new possibilities to developers. This&#xD;
dissertation provides one such application of blockchain technology, which area is&#xD;
blockchain based electronic voting. This application is analysed in the context of Malta&#xD;
and is aimed at replacing the currently used paper-based vote casting system. The&#xD;
system is designed under the constraint that the users have to be physically present at&#xD;
the polling booths The voting system is finally evaluated in terms of security and&#xD;
efficiency.
Description: B.SC.IT(HONS)</description>
      <pubDate>Sun, 01 Jan 2017 00:00:00 GMT</pubDate>
      <guid isPermaLink="false">https://www.um.edu.mt/library/oar/handle/123456789/25816</guid>
      <dc:date>2017-01-01T00:00:00Z</dc:date>
    </item>
  </channel>
</rss>

