Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/63045
Full metadata record
DC FieldValueLanguage
dc.date.accessioned2020-11-02T07:46:57Z-
dc.date.available2020-11-02T07:46:57Z-
dc.date.issued2020-
dc.identifier.citationPadovani, I. (2020). Intelligent speech recognition data acquisition for Maltese (Bachelor's dissertation).en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar/handle/123456789/63045-
dc.descriptionB.SC.(HONS)HUMAN LANGUAGE TECH.en_GB
dc.description.abstractAutomatic Speech Recognition is a difficult task for under-resourced languages such as Maltese, as large quantities of data are required for its development. This dissertation seeks to provide a solution to this issue by crowdsourcing speech recordings and devising ways of validating this data efficiently. Common Voice was used as a crowdsourcing platform, facilitating the collection of 11+hours of speech data since its launch for Maltese. For validation, phonological analysis was performed on the text prompts using a grapheme-to-phoneme tool. The results of this were then compared to the number of syllables and segments detected in the speech using syllable nucleus detection and unsupervised automatic phoneme segmentation. Syllable distance between recordings and prompts was seen to be an effective metric for validation down to distances as small as a single syllable. Segment distance was effective when faced with differences of a few syllables or more.en_GB
dc.language.isoenen_GB
dc.rightsinfo:eu-repo/semantics/restrictedAccessen_GB
dc.subjectSpeech processing systemsen_GB
dc.subjectAutomatic speech recognitionen_GB
dc.subjectCrowdsourcingen_GB
dc.subjectMaltese languageen_GB
dc.titleIntelligent speech recognition data acquisition for Malteseen_GB
dc.typebachelorThesisen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.publisher.institutionUniversity of Maltaen_GB
dc.publisher.departmentInstitute of Linguistics and Language Technologyen_GB
dc.description.reviewedN/Aen_GB
dc.contributor.creatorPadovani, Ian-
Appears in Collections:Dissertations - InsLin - 2020

Files in This Item:
File Description SizeFormat 
20BSCHLT003.pdf
  Restricted Access
2.22 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.