Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/22577
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAzzopardi, Joel-
dc.date.accessioned2017-10-13T15:44:48Z-
dc.date.available2017-10-13T15:44:48Z-
dc.date.issued2007-
dc.identifier.citationAzzopardi, J. (2007). Automatic clustering of news reports. 5th Computer Science Annual Workshop (CSAW’07), Msida. 11-23.en_GB
dc.identifier.urihttps://www.um.edu.mt/library/oar//handle/123456789/22577-
dc.description.abstractThe automatic clustering of news reports from various web-based news sites into clusters according to the event they cover serves not only to facilitate browsing of news reports by a users but may also serve as an initial stage in other complex systems such as Multi-Document Summarization systems or Document Fusion systems. In contrast to the usual scenarios of document clustering whereby the document collections are static or quasi-static, news sites are continuously updated with re- ports concerning new events. Here, we present a News Report Clustering system which is able to receive a stream of news reports which it clusters on the fly according to the event they cover. New clusters are automat- ically created as necessary for news reports which are covering ‘new’, previously unreported events. We compare the results of our system to the results produced by a standard K-Means clustering system, and we show that our system performs significantly better than the standard K- Means system even though the K-Means system was supplied with the correct number of clusters that should be produced. In fact, our clustering system obtained an average of 11.95% better recall, 28.68% better precision and 0.89% less fallout than the standard K-Means clustering system.en_GB
dc.language.isoenen_GB
dc.publisherUniversity of Malta. Faculty of ICTen_GB
dc.rightsinfo:eu-repo/semantics/openAccessen_GB
dc.subjectDocument clusteringen_GB
dc.subjectCluster analysis -- Data processingen_GB
dc.subjectCluster analysis -- Computer programsen_GB
dc.subjectNews Web sitesen_GB
dc.titleAutomatic clustering of news reportsen_GB
dc.typeconferenceObjecten_GB
dc.rights.holderThe copyright of this work belongs to the author(s)/publisher. The rights of this work are as defined by the appropriate Copyright Legislation or as modified by any successive legislation. Users may access this work and can make use of the information contained in accordance with the Copyright Legislation provided that the author must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the prior permission of the copyright holder.en_GB
dc.bibliographicCitation.conferencename5th Computer Science Annual Workshop (CSAW’07)en_GB
dc.bibliographicCitation.conferenceplaceMsida, Malta, 5-6/11/2007en_GB
dc.description.reviewedpeer-revieweden_GB
Appears in Collections:Scholarly Works - FacICTAI
Scholarly Works - FacICTCS

Files in This Item:
File Description SizeFormat 
Proceedings of CSAW’07 - A2.pdf296.91 kBAdobe PDFView/Open


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.