Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/16950
Title: Amilcare : adaptive information extraction for document annotation
Authors: Ciravegna, Fabio
Dingli, Alexiei
Wilks, Yorick
Petrelli, Daniela
Keywords: Knowledge management
Corpora (Linguistics)
Natural language processing (Computer science)
Self-adaptive software
Expert systems (Computer science)
Issue Date: 2002
Publisher: The ACM Digital Library
Citation: Ciravegna, F., Dingli, A., Wilks, Y., & Petrelli, D. (2002). Amilcare: adaptive information extraction for document annotation. 25th ACM/SIGIR International Conference on Research and Development in Information Retrieval, Tampere. 367-368.
Abstract: The traditional process of document annotation for knowledge identification and extraction in Knowledge Management (KM) is complex and time consuming, as it requires manual annotation by domain experts. In the typical scenario a domain expert:(1) builds an ontology describing the application domain: (2) annotates a number of texts in order to identify instances of elements (i.e., concepts and relations) in the texts. There is a strong interest in Text Mining technologies (and in particular in Human Language- based Technologies), for reducing the burden of text annotation for KM [1]. In this paper we show how Information Extraction from texts (IE) can provide support for document enrichment and make the text annotation process more effective and efficient. The main challenge to be addressed by IE researchers in this framework is portability of IE systems to new applications with no knowledge of HLT. As a matter of fact, in terms of IE each annotation task (e.g. tagging texts about failures in cars in order to identify faults and involved car parts) requires to port the IE system to a new application domain. IE is just one of the many technologies required by complex KM environments: wide use of IE tools will come only when the definition of such application domain will not require any specific IE skill apart from notions of KM. Moreover there is the need to port across different text types without major recoding of system resources, as documents in KM can range from free texts (technical reports, newspaper-like texts) to structured marked up texts (e.g. highly structured XML/HTML documents) and even a mixture of them [2].
URI: https://www.um.edu.mt/library/oar//handle/123456789/16950
ISSN: 01635840
Appears in Collections:Scholarly Works - FacICTAI

Files in This Item:
File Description SizeFormat 
Conference paper - Amilcare Adaptive information extraction for document annotation.pdf
  Restricted Access
Amilcare : adaptive information extraction for document annotation784.42 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.