Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/119355
Title: Resources and tools for pre-processing speech data in a lesser-known variety of English
Other Titles: Proceedings of the 20th International Congress of Phonetic Sciences
Authors: Vella, Alexandra
Grech, Sarah
Padovani, Ian
Micallef, Maria-Christina
Keywords: Information resources
Phonetics
Grammar, Comparative and general -- Phonology
Malta -- Languages
English language -- Variation
Issue Date: 2023
Publisher: Guarant International
Citation: Vella, A., Grech, S., Padovani, I., & Micallef, M. C. (2023). Resources and tools for pre-processing speech data in a lesser-known variety of English. In R. Skarnitzl & J. Volín (Eds.), Proceedings of the 20th International Congress of Phonetic Sciences (pp. 3369–3373). Prague: Guarant International
Abstract: Research on lesser-known language varieties can be hindered from the outset by the need for both data and tools for automating the required pre-processing work. For speech, whilst more ecologically valid data in the form of video and audio are sometimes available, these need to be accompanied by a machine-readable text, ideally segmented and labelled, both allowing for searchability. A significant initial commitment is needed even before the relevant phonetic and phonological research can begin. This paper demonstrates and evaluates the efficacy of already existing tools (YouTube captioning and WebMAUS forced alignment) in automating the pre-processing work required using Maltese English (MaltE), whilst also showcasing a sample analysis of the pronunciation of post-vocalic ‘r’ in the variety. As a low resource variety of English, MaltE presents a test case for showing how existing resources and tools can be utilised to work with language varieties which are digitally less well-supported.
URI: https://www.um.edu.mt/library/oar/handle/123456789/119355
ISBN: 9788090811423
Appears in Collections:Scholarly Works - InsLin

Files in This Item:
File Description SizeFormat 
Resources_and_tools_for_pre_processing_speech_data_in_a_lesser_known_variety_of_English.pdf940.6 kBAdobe PDFView/Open


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.