Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/92160
Title: The development of a statistical spell checker for Maltese
Authors: Mizzi, Ruth (2000)
Keywords: Natural language generation (Computer science)
Statistics
Computational linguistics
Issue Date: 2000
Citation: Mizzi, R. (2000). The development of a statistical spell checker for Maltese (Bachelor's dissertation).
Abstract: The detection and correction of spelling errors is an integral part of most modern word-processors. This is a consequence of the large amount of misspelled words that one commonly finds in typed text. The reasons for this frequent occurrence of misspellings are varied but, among these, the one which carries most weight is the nature of Natural Languages which is extremely complex and full of exceptions. This project is concerned with the development of a statistical spell checker for the Maltese Language. The very rich morphology of this language poses the problem that traditional methods of Spelling Correction, such as dictionary lookup, are not ideal approaches to the problem at hand. In this work, the approach taken is based on statistical N-Gram spelling models. Various other statistical and probabilistic models, concerned with the problem of context-sensitive spelling correction, are also analysed. These additional techniques aim to provide a solution for the detection of real-word errors in a document, that is, the detection of those misspellings that result in an existing word in the language. The approach followed in these techniques is that of developing a statistical model from an existing training corpus and then using that model to interpret new, unseen data. Implementation involved the development of two separate tools. The first tool offers all the functions necessary for generating statistics while the second tool is the actual spell checker which makes use of these generated statistics in order to analyse the given text. The results obtained were promising and, when experiments were carried out using both the Maltese and English Language, it was concluded that there are no obvious disparities between the results obtained in both cases.
Description: B.Sc. IT (Hons)(Melit.)
URI: https://www.um.edu.mt/library/oar/handle/123456789/92160
Appears in Collections:Dissertations - FacICT - 1999-2009
Dissertations - FacICTCS - 1999-2007

Files in This Item:
File Description SizeFormat 
B.SC.(HONS)IT_Mizzi_Ruth_2000.PDF
  Restricted Access
6.51 MBAdobe PDFView/Open Request a copy
Mizzi_Ruth_acc.material.pdf
  Restricted Access
64.46 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.