Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/63045
Title: Intelligent speech recognition data acquisition for Maltese
Authors: Padovani, Ian
Keywords: Speech processing systems
Automatic speech recognition
Crowdsourcing
Maltese language
Issue Date: 2020
Citation: Padovani, I. (2020). Intelligent speech recognition data acquisition for Maltese (Bachelor's dissertation).
Abstract: Automatic Speech Recognition is a difficult task for under-resourced languages such as Maltese, as large quantities of data are required for its development. This dissertation seeks to provide a solution to this issue by crowdsourcing speech recordings and devising ways of validating this data efficiently. Common Voice was used as a crowdsourcing platform, facilitating the collection of 11+hours of speech data since its launch for Maltese. For validation, phonological analysis was performed on the text prompts using a grapheme-to-phoneme tool. The results of this were then compared to the number of syllables and segments detected in the speech using syllable nucleus detection and unsupervised automatic phoneme segmentation. Syllable distance between recordings and prompts was seen to be an effective metric for validation down to distances as small as a single syllable. Segment distance was effective when faced with differences of a few syllables or more.
Description: B.SC.(HONS)HUMAN LANGUAGE TECH.
URI: https://www.um.edu.mt/library/oar/handle/123456789/63045
Appears in Collections:Dissertations - InsLin - 2020

Files in This Item:
File Description SizeFormat 
20BSCHLT003.pdf
  Restricted Access
2.22 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.