Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/135964
Title: Correlation of object detection performance with visual saliency and depth estimation
Authors: Bartolo, Matthias
Seychell, Dylan
Keywords: Computer vision
Image processing -- Methodology
Reinforcement learning
Deep learning (Machine learning)
Issue Date: 2024
Publisher: arXiv
Citation: Bartolo, M., & Seychell, D. (2024). Correlation of Object Detection Performance with Visual Saliency and Depth Estimation. arXiv preprint arXiv:2411.02844.
Abstract: As object detection techniques continue to evolve, understanding their relationships with complementary visual tasks becomes crucial for optimising model architectures and computational resources. This paper investigates the correla- tions between object detection accuracy and two fundamental visual tasks: depth prediction and visual saliency prediction. Through comprehensive experiments using state-of-the-art mod- els (DeepGaze IIE, Depth Anything, DPT-Large, and Itti’s model) on COCO and Pascal VOC datasets, we find that visual saliency shows consistently stronger correlations with object detection accuracy (mAρ up to 0.459 on Pascal VOC) compared to depth prediction (mAρ up to 0.283). Our analysis reveals significant variations in these correlations across object categories, with larger objects showing correlation values up to three times higher than smaller objects. These findings suggest incorporating visual saliency features into object detection architectures could be more beneficial than depth information, particularly for specific object categories. The observed category-specific variations also provide insights for targeted feature engineering and dataset design improvements, potentially leading to more efficient and accurate object detection systems.
URI: https://www.um.edu.mt/library/oar/handle/123456789/135964
Appears in Collections:Scholarly Works - FacICTAI

Files in This Item:
File Description SizeFormat 
Correlation of object detection performance with visual saliency and depth estimation.pdf7.43 MBAdobe PDFView/Open


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.