<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <title>OAR@UM Collection:</title>
  <link rel="alternate" href="https://www.um.edu.mt/library/oar/handle/123456789/134219" />
  <subtitle />
  <id>https://www.um.edu.mt/library/oar/handle/123456789/134219</id>
  <updated>2026-06-03T11:54:30Z</updated>
  <dc:date>2026-06-03T11:54:30Z</dc:date>
  <entry>
    <title>Leveraging invariant prediction for mitigating specificity constraints in affect modelling</title>
    <link rel="alternate" href="https://www.um.edu.mt/library/oar/handle/123456789/146931" />
    <author>
      <name />
    </author>
    <id>https://www.um.edu.mt/library/oar/handle/123456789/146931</id>
    <updated>2026-05-29T08:00:32Z</updated>
    <published>2025-01-01T00:00:00Z</published>
    <summary type="text">Title: Leveraging invariant prediction for mitigating specificity constraints in affect modelling
Abstract: Affect modelling aims to predict human emotional states from multimodal signals,                                              yet current approaches often struggle to generalise beyond the specific datasets&#xD;
or contexts in which they are trained. This dissertation investigates the use of in&#xD;
variant features, predictors whose relationship with affective states remains stable&#xD;
across distinct environments, as a strategy to improve generalisability. To this end,&#xD;
two publicly available corpora, AGAIN and RECOLA, were systematically parti&#xD;
tioned into environments defined by user, task, and annotator triplets. An envi&#xD;
ronment refers to the conditions under which data is collected, and data gathered&#xD;
within the same environment is assumed to come from the same underlying distri&#xD;
bution. The Invariant Causal Prediction (ICP) framework was employed to identify&#xD;
stable features across these environments, which were then compared against full&#xD;
feature sets and principal components derived through PCA.&#xD;
Three supervised learning models—Logistic Regression, a feed-forward Neural&#xD;
Network,and a Long Short-Term Memory (LSTM)network — were trained under all&#xD;
three feature conditions, using group-based cross-validation to avoid information&#xD;
leakage. Results demonstrate that invariant features can deliver measurable benefits                          for feed-forward models, particularly in enhancing accuracy and correlation&#xD;
while substantially reducing feature dimensionality. However, their advantages&#xD;
were less consistent for sequence models like LSTMs, where temporal dependencies                       were not fully captured by invariants alone. Statistical significance tests further&#xD;
showed that invariant features improved balanced classification (F1) more strongly&#xD;
in AGAIN than in RECOLA,underscoring the dataset-specific nature of their effectiveness.&#xD;
Overall, the findings highlight both the promise and the limitations of invariance                                     in affect modelling. While not a universal solution, invariant features represent&#xD;
a principled means of isolating robust predictors across heterogeneous contexts,&#xD;
contributing to the broader goal of developing affective systems that are reliable,&#xD;
interpretable, and adaptable across diverse real-world settings.
Description: M.Sc.(Melit.)</summary>
    <dc:date>2025-01-01T00:00:00Z</dc:date>
  </entry>
  <entry>
    <title>Siamese network‐based vector embeddings of MRI scans for twin identification</title>
    <link rel="alternate" href="https://www.um.edu.mt/library/oar/handle/123456789/146930" />
    <author>
      <name />
    </author>
    <id>https://www.um.edu.mt/library/oar/handle/123456789/146930</id>
    <updated>2026-05-29T07:59:03Z</updated>
    <published>2025-01-01T00:00:00Z</published>
    <summary type="text">Title: Siamese network‐based vector embeddings of MRI scans for twin identification
Abstract: Monozygotic twins are identical twins that develop from a single fertilised egg that&#xD;
spontaneously splits, resulting in two individuals sharing 100% genetic material.&#xD;
Identifying monozygotic twins from brain MRI scans represents a frontier challenge in&#xD;
computational medical imaging with significant implications for understanding genetic&#xD;
influences on neuroanatomical structure through direct pattern recognition. While&#xD;
classical twin studies using ACE models decompose statistical variance to establish&#xD;
independent regional heritability estimates (60‐80%), this study introduces a&#xD;
fundamentally different computational framework that learns directly from MRI data to&#xD;
rank neuroanatomical regions by their collective discriminative capacity for genetic&#xD;
similarity detection, complementing traditional statistical approaches through&#xD;
data‐driven analysis.&#xD;
Adeep learning methodology employing Siamese networks with 3D CNN&#xD;
backbones is developed for automated twin identification using 138 genetically&#xD;
verified monozygotic twin pairs (276 subjects) from the Human Connectome Project&#xD;
S1200 dataset. Modified U‐Net, ResNet, and DenseNet architectures generate&#xD;
128‐dimensional embeddings optimised via triplet loss with hard negative mining,&#xD;
forcing models to learn subtle genetic signatures by focusing on challenging&#xD;
discriminative examples that distinguish twins from their most similar morphological&#xD;
matches.&#xD;
U‐Net achieved superior computational performance with 92.0% F1‐score&#xD;
(σ = 2.5%), 95.2% AUC‐ROC, and 91.4% accuracy, while ResNet demonstrated&#xD;
competitive results (89.6% F1‐score) and DenseNet showed greater variability (88.5%&#xD;
F1‐score). Embedding analysis reveals clear bimodal separation between genetically&#xD;
related and unrelated individuals through learned morphological patterns.&#xD;
Layer‐Wise Relevance Propagation analysis provides the first data‐driven&#xD;
ranking of neuroanatomical regions by discriminative importance for genetic&#xD;
relatedness detection. Statistical analysis reveals pronounced subcortical dominance&#xD;
with large effect size (Cohen’s d = 2.80, p = 3.89e‐6), with six subcortical structures&#xD;
occupying top positions, including the thalamus (0.955), brainstem (0.875), and&#xD;
hypothalamus (0.707). This computational hierarchy contrasts with traditional ACE&#xD;
studies reporting highest heritability in cortical areas (frontal 78‐95%, temporal&#xD;
77‐89%), demonstrating that direct pattern recognition from MRI data identifies&#xD;
different neuroanatomical signatures than statistical variance decomposition. Notably,&#xD;
models utilise practically all brain regions (most importance scores &gt; 0.2), indicating&#xD;
distributed multivariate processing rather than selective regional dependence.&#xD;
Ablation studies confirm data augmentation’s critical role, with substantial&#xD;
i&#xD;
performance improvements across CNN architectures. Clinical integration through&#xD;
standard neuroimaging formats in Connectome Workbench demonstrates immediate&#xD;
practical utility, positioning this computational approach for adoption in research and&#xD;
clinical environments requiring direct analysis of genetic influences in brain structure.&#xD;
The framework advances precision neuroimaging by providing automated,&#xD;
quantitative genetic similarity detection through direct pattern recognition, revealing&#xD;
spatial insights that complement traditional heritability studies while offering&#xD;
methodological advances applicable to diverse medical imaging classification tasks&#xD;
requiring regional discriminative analysis
Description: M.Sc.(Melit.)</summary>
    <dc:date>2025-01-01T00:00:00Z</dc:date>
  </entry>
  <entry>
    <title>AI‐Driven gesture recognition with smart gloves</title>
    <link rel="alternate" href="https://www.um.edu.mt/library/oar/handle/123456789/141991" />
    <author>
      <name />
    </author>
    <id>https://www.um.edu.mt/library/oar/handle/123456789/141991</id>
    <updated>2025-12-05T10:17:45Z</updated>
    <published>2025-01-01T00:00:00Z</published>
    <summary type="text">Title: AI‐Driven gesture recognition with smart gloves
Abstract: This research presents the development of an AI‐driven gesture recognition system&#xD;
aimed to enhance Human‐Computer Interaction through the use of smart gloves.&#xD;
Many emerging applications, such as virtual reality, robotics, and assistive technologies,&#xD;
require detailed motion capture of the hand in three dimensions. Traditional input&#xD;
devices are not designed to capture such motion, whereas wearable solutions like&#xD;
smart gloves offer a practical means of collecting complex motion data for gesture&#xD;
interpretation. This study proposes a system capable of interpreting dynamic hand&#xD;
gestures captured using smart gloves.&#xD;
A custom dataset was collected using Rokoko smart gloves, recording 14&#xD;
gesture classes from 14 subjects. Time‐series data captured from the smart gloves was&#xD;
preprocessed, and a range of feature extraction methods, including statistical,&#xD;
frequency‐domain, and motion‐based techniques, were applied. Experimental results&#xD;
were carried out to determine which features or combination of features gives the best&#xD;
result. Dimensionality reduction methods, namely Principal Component Analysis and&#xD;
Autoencoders, were examined to optimise the feature space and reduce complexity.&#xD;
A number of classification models were implemented and compared, including&#xD;
Support Vector Machines, K‐Nearest Neighbours, Hidden Markov Models, as well as,&#xD;
deep learning approaches such as CNN‐LSTM networks. Experimental results showed&#xD;
that while most models achieved high accuracy on validation data, up to 93.64%,&#xD;
performance significantly decreased when tested on data from unseen subjects,&#xD;
dropping to 20.39‐28.93%. This highlights the challenge of inter‐subject&#xD;
generalisation. To mitigate this, personalised models were implemented, showing good&#xD;
performance improvements. The SVM classifiers achieved accuracy results ranging&#xD;
from 67.9% to 92.9%, and the majority of precision, recall, and F1 scores exceeding&#xD;
85%, while CNN‐LSTM models achieved an accuracy above 95% consistently.&#xD;
Precision, recall, and F1‐score values also remained high.&#xD;
This work contributes to the field of gesture recognition by systematically&#xD;
evaluating feature engineering and modelling techniques on multichannel time‐series&#xD;
data. It underscores the importance of personalised learning strategies and provides&#xD;
insight into the practical limitations of real‐world deployment, such as latency and&#xD;
subject variability. Future work may explore domain adaptation, multimodal sensing,&#xD;
and real‐time implementation to further advance robust gesture‐based interfaces.
Description: M.Sc.(Melit.)</summary>
    <dc:date>2025-01-01T00:00:00Z</dc:date>
  </entry>
  <entry>
    <title>Table selection using information retrieval techniques for table-agnostic Text-to-SQL</title>
    <link rel="alternate" href="https://www.um.edu.mt/library/oar/handle/123456789/141983" />
    <author>
      <name />
    </author>
    <id>https://www.um.edu.mt/library/oar/handle/123456789/141983</id>
    <updated>2025-12-05T09:53:57Z</updated>
    <published>2025-01-01T00:00:00Z</published>
    <summary type="text">Title: Table selection using information retrieval techniques for table-agnostic Text-to-SQL
Abstract: Text-to-SQL has been effectively addressed using various NLP approaches,&#xD;
enabling the translation of natural language queries into SQL queries. A common prerequisite for these implementations, however, is the availability of the database table during inference. This requirement can pose challenges in scenarios where the table is not readily accessible to users. This work is motivated by the ongoing development of a chatbot tool within a private company, aimed&#xD;
at streamlining database interactions for the users. To address the table accessibility limitation, this study leverages Information Retrieval techniques to implement table selection based solely on the natural language query. We finetune pre-trained models like BERT and GIST-NoInstruct using the ColBERT method. We train our models using data we curate in-house by employing established LLM-prompting techniques. We prepare individual training datasets using two negative sampling techniques: uniform distribution and weighted probability distribution. We also experiment with various data fusion techniques such as RRF, CombMNZ, and Linear Combination to combine results from multiple search&#xD;
strategies. Our approach outperforms baseline methods in table retrieval, while also providing a comparative analysis of various retrieval strategies.
Description: M.Sc.(Melit.)</summary>
    <dc:date>2025-01-01T00:00:00Z</dc:date>
  </entry>
</feed>

