Study-Unit Description

Study-Unit Description


CODE CSA5028

 
TITLE Finite State Machinery and Computational Morphology

 
UM LEVEL 05 - Postgraduate Modular Diploma or Degree Course

 
MQF LEVEL Not Applicable

 
ECTS CREDITS 6

 
DEPARTMENT Intelligent Computer Systems

 
DESCRIPTION Linguistic Morphology studies the internal structure of words. The main issues that have to be considered are (a) how a word is segmented into its component parts, (b) which parts are common to different forms of the same word and (c) how the parts interact with each other to define the particular nature of a given wordform. Computational morphology attempts to shed light on these issues by building computational models. For the most part these models are based on Finite State Automata of different kinds.

The study-unit is divided into two parts. The first part tackles the linguistic background dealing with the core phenomena of inflection, derivation, compounding, and clitics. Different languages will be looked at, including Maltese. Some related issues such as the lexicon and POS tags will also be discussed.

The second part of the study-unit looks at some of the computational issues concerned with morphological analysis and generation. The use of various toolkits will be illustrated. In addition, some other issues will be looked including how machine learning techniques can be applied to the problem of acquiring morphology automatically.

There are two assignments associated with this study-unit. The first will be mainly descriptive, involving the description of the morphology of a particular language. The second will involve the creation of an analyzer / generator for the chosen language.

Use will be made of Xerox technologies such as xfst and lexc toolbox developed at Stanford by Lauri Karttunen, Ken Beesley. If there is time other technologies will be looked at such as Goldsmith's Linguistica. Approaches to the lexicon will also be discussed in particular with reference to morphological analysis and generation.

Textbooks:

• Spencer, Andrew and Zwicky, Arnold M. (Eds.) (1998). The handbook of morphology. Blackwell handbooks in linguistics. Oxford: Blackwell. ISBN 0-631-18544-5.
• Monachesi, P. (2005) The Verbal Complex in Romance. OUP. Ch. 3.
• Sproat, R.W. (1992). Morphology and Computation. MIT Press, Cambridge, MA.
• K.Beesley and L. Karttunen (2003), Finite State Morphology, Stanford:CLSI Publications

Articles:

• Lauri Karttunen, Kimmo Koskenniemi, Gertjan van Noord. Special issue: Finite State Methods in Natural Language Processing. Natural Language Engineering. Volume 9, Part 1, March 2003.

 
STUDY-UNIT TYPE Lecture and Tutorial

 
METHOD OF ASSESSMENT
Assessment Component/s Sept. Asst Session Weighting
Assignment Yes 100%

 
LECTURER/S Ray Fabri (Co-ord.)
Aarne Ranta

 

 
The University makes every effort to ensure that the published Courses Plans, Programmes of Study and Study-Unit information are complete and up-to-date at the time of publication. The University reserves the right to make changes in case errors are detected after publication.
The availability of optional units may be subject to timetabling constraints.
Units not attracting a sufficient number of registrations may be withdrawn without notice.
It should be noted that all the information in the description above applies to study-units available during the academic year 2023/4. It may be subject to change in subsequent years.

https://www.um.edu.mt/course/studyunit