Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/22443
Title: | TTS pre-processing issues for mixed language support |
Authors: | Farrugia, Paulseph-John |
Keywords: | Text-to-speech software Speech processing systems -- Computer programs Code switching (Linguistics) Bilingualism -- Malta |
Issue Date: | 2004 |
Publisher: | University of Malta. Faculty of ICT |
Citation: | Farrugia, P. (2004). TTS pre-processing issues for mixed language support. 2nd Computer Science Annual Workshop (CSAW’04), Kalkara. 36-41. |
Abstract: | The design of an open domain Text-to-Speech (TTS) system mandates a preprocessing module that is responsible for preparing the input text so that it can be handled in a standard fashion by other TTS components. There are several pre-processing issues that are common across input domains, such as number, date and acronym handling. Others may be specific to the type of input under examination. In designing a TTS system supporting Maltese for SMS messages in the local context, specific issues are also encountered. In terms of language issues, the practical use of Maltese is heavily characterised by code-switching into other languages, most commonly in English. While the resulting language may not be considered ‘Maltese’ in the strictest sense of the language definition, it creates a state of affairs that cannot simply be ignored by a general purpose system. In respect of the SMS domain, the use of various shorthand notation and lack of phrase structure is encountered. This paper describes these specific issues in further detail and discusses techniques with which they may be addressed. |
URI: | https://www.um.edu.mt/library/oar//handle/123456789/22443 |
Appears in Collections: | Scholarly Works - FacICTCS |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Proceedings of CSAW’04 - A4.pdf | 123.58 kB | Adobe PDF | View/Open |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.