Please use this identifier to cite or link to this item:
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLayfield, Colin-
dc.contributor.authorIvanović, Dragan-
dc.contributor.authorAzzopardi, Joel-
dc.identifier.citationLayfield, C., Ivanović, D., & Azzopardi, J. (2017, September). Multi-Lingual LSA with Serbian and Croatian: An Investigative Case Study. Third International KEYSTONE Conference, Poland. 155-164.en_GB
dc.description.abstractOne of the challenges in information retrieval is attempting to search a corpus of documents that may contain multiple languages. This exploratory study expands upon earlier research employing Latent Semantic Analysis (so called Multi-Lingual Latent Semantic Indexing, or ML-LSI/LSA). We experiment using this approach, and a new one, in a multi-lingual context utilising two similar languages, namely Serbian and Croatian. Traditionally, with an LSA approach, a parallel corpus would be needed in order to train the system by combining identical documents in two languages into one document. We repeat that approach and also experiment with creating a semantic space using the parallel corpus on its own without merging the documents together to test the hypothesis that, with very similar languages, the merging of documents may not be required for good results.en_GB
dc.subjectInformation retrieval -- Case studies.en_GB
dc.subjectInformation storage and retrieval systemsen_GB
dc.subjectLatent semantic indexingen_GB
dc.subjectNatural language processing (Computer science)en_GB
dc.subjectSearch enginesen_GB
dc.subjectCroatian language -- Data processingen_GB
dc.subjectSerbian language -- Data processingen_GB
dc.titleMulti-lingual LSA with Serbian and Croatian : an investigative case studyen_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.bibliographicCitation.conferencenameInternational KEYSTONE Conferenceen_GB
dc.bibliographicCitation.conferenceplaceGdańsk, Poland. 11-12/Sep/2017en_GB
Appears in Collections:Scholarly Works - FacICTAI

Files in This Item:
File Description SizeFormat 
  Restricted Access
232.66 kBAdobe PDFView/Open Request a copy

Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.