Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/80628
Title: Demystifying the Voynich manuscript using computational linguistic techniques
Authors: Farrugia, Kevin (2021)
Keywords: Voynich manuscript
Machine learning
Support vector machines
Issue Date: 2021
Citation: Farrugia, K. (2021). Demystifying the Voynich manuscript using computational linguistic techniques (Bachelor's dissertation).
Abstract: The Voynich manuscript is a medieval codex written in undeciphered text by unknown people. In the paper 'How Many Glyphs and How Many Scribes? Digital Paleography and the Voynich Manuscript' by Dr Lisa Fagin Davis (Davis, 2020a), it is proposed that the manuscript is written by five scribes, and a detailed classification is provided. The goal of this research is to provide further evidence to strengthen this proposed classification and to surface any possible misclassification. Towards this end, an experiment was conducted that takes this proposed classification as ground truth and puts it to the test. The data acquired is a transliteration of the manuscript written in the Extensible Voynich Alphabet. It is split into an equal number of pages per scribe; taking into consideration three of the five scribes due to scribes 4 and 5 having much less data. The experiment utilizes stylometric features, namely character bigrams and character trigrams, as features in four different machine learning classifiers; a Deep Neural Network, a Multinomial Naive Bayes classifier, Support Vector Machines and a Decision Tree classifier. This is done in a ten-fold cross-validation where models are trained and predict a scribe for each page. Separately, a k-means unsupervised clustering algorithm is implemented using the same features with k = 3. The results of cross-validation and clustering are compared with the proposal of Dr Fagin Davis.
Description: B.Sc. (Hons) HLT (Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/80628
Appears in Collections:Dissertations - InsLin - 2021

Files in This Item:
File Description SizeFormat 
Kevin Farrugia.pdf
  Restricted Access
10.04 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.