Please use this identifier to cite or link to this item:
Title: Comparing title only and full text indexing to classify web pages into bookmark categories
Authors: Staff, Chris
Abela, Charlie
Keywords: Browsers (Computer programs)
Web browsing
File organization (Computer science)
Issue Date: 2007
Publisher: University of Malta. Faculty of ICT
Citation: Abela, C., & Staff, C. (2007). Comparing title only and full text indexing to classify web pages into bookmark categories. 5th Computer Science Annual Workshop (CSAW’07), Msida. 248-258.
Abstract: Web browser bookmark files are used to retain and organise records of web sites that the user would like to revisit. However, bookmark files tend to be under-utilised, as time and effort is needed to keep them organised. We use two methods to index and automatically classify documents referred to in 80 bookmark files, based on document title-only and full-text indexing, respectively. We evaluate the indexing methods by selecting a bookmark entry to classify from a bookmark file, and recreating the bookmark file so that it contains only entries created before the selected bookmark entry. Classification based on full-text indexing generally outperforms that based on document title only indexing. The ability to recommend the correct category at rank 1 using full-text indexing ranges from 20% to 41%, depending on the number of category members. However, combining the approaches results in a increase to 37% — 59%, but we would need to recommend up to two categories to users. By recommending up to 10 categories, this increases to 58% — 80%.
Appears in Collections:Scholarly Works - FacICTAI
Scholarly Works - FacICTCS

Files in This Item:
File Description SizeFormat 
Proceedings of CSAW’07 - A24.pdf231.96 kBAdobe PDFView/Open

Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.