Please use this identifier to cite or link to this item:
Title: Automatic semantic annotation using unsupervised information extraction and integration
Authors: Dingli, Alexiei
Ciravegna, Fabio
Wilks, Yorick
Keywords: Semantic Web
Text processing (Computer science)
Machine learning
Semantic integration (Computer systems)
Digital libraries
Issue Date: 2003
Citation: Dingli, A., Ciravegna, F., & Wilks, Y. (2003). Automatic semantic annotation using unsupervised information extraction and integration. K-CAP 2003 Workshop of Knowledge Markup and Semantic Annotation, Sanibel. 1-8.
Abstract: In this paper we propose a methodology to learn to automatically annotate domain-specific information from large repositories (e.g. Web sites) with minimum user intervention. The methodology is based on a combination of information extraction, information integration and machine learning techniques. Learning is seeded by extracting information from structured sources (e.g. databases and digital libraries). Retrieved information is then used to partially annotate documents. These annotated documents are used to bootstrap learning for simple Information Extraction (IE) methodologies, which in turn will produce more annotations used to annotate more documents. It will be used to train more complex IE engines and the cycle will keep on repeating itself until the required information is obtained. The user intervention is limited to providing an initial URL and to correct information if it is the case when the computation is finished. The revised annotation can then be reused to provide further training and therefore getting more information and/or more precision.
Appears in Collections:Scholarly Works - FacICTAI

Files in This Item:
File Description SizeFormat 
OA - Automatic Semantic Annotation using Unsupervised Information Extraction and Integration.2-9.pdfAutomatic semantic annotation using unsupervised information extraction and integration204.09 kBAdobe PDFView/Open

Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.