Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/103262
Title: | Fusion of news reports using surface-based methods |
Authors: | Azzopardi, Joel Staff, Christopher |
Keywords: | Document clustering -- Methodology News Web sites Cluster analysis -- Data processing Conceptual structures (Information theory) |
Issue Date: | 2012 |
Publisher: | Institute of Electrical and Electronics Engineers |
Citation: | Azzopardi, J., & Staff, C. (2012). Fusion of News Reports Using Surface-Based Methods. In Proceedings of the 2012 26th International Conference on Advanced Information Networking and Applications Workshops, Fukuoka, Japan. 809-814. |
Abstract: | Events occurring in the real world are covered by news reports from different sources. Each report generally contains information that is found in others, but may also contain unique information. To learn all the information about a particular event, a user will need to read all the different reports. This is a duplication of effort since most information will be repeated in the different reports. In our research, we attempt to fuse news reports about the same event into a single coherent document eliminating repetition but preserving all the information contained in the source reports using only surface-based methods. Information in each news report is represented by a set of entity relationship graphs. The graphs representing each report are then merged into a single graph whilst keeping track of the source sentences. The fused report is generated using the maximally expressive set of sentences – the sentences that carry most information about the entities and their relationships in the news report, and ensuring that all entities and relationships are expressed in the fused document. Our Document fusion system was evaluated using a set of news reports downloaded from MSNBC News that cite their sources, and also using human evaluation. We show that our system is able to capture most of the information found across different source documents whilst maintaining readability. |
URI: | https://www.um.edu.mt/library/oar/handle/123456789/103262 |
Appears in Collections: | Scholarly Works - FacICTAI |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Fusion of news reports using surface based methods 2012.pdf Restricted Access | 150.02 kB | Adobe PDF | View/Open Request a copy |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.