Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/85808
Title: Improving super-resolution performance using meta-attention layers
Authors: Aquilina, Matthew
Galea, Christian
Abela, John
Camilleri, Kenneth P.
Farrugia, Reuben A.
Keywords: Image processing
Optical data processing
Computer graphics
Pattern recognition
Image reconstruction
Issue Date: 2021
Publisher: IEEE
Citation: Aquilina, M., Galea, C., Abela, J., Camilleri, K. P., & Farrugia, R. A. (2021). Improving super-resolution performance using meta-attention layers. IEEE Signal Processing Letters, 28, 2082-2086.
Abstract: Convolutional Neural Networks (CNNs) have achieved impressive results across many super-resolution (SR) and image restoration tasks. While many such networks can upscalelow-resolution (LR) images using just the raw pixel-level information, the ill-posed nature of SR can make it difficult to accurately super-resolve an image which has undergone multiple different degradations. Additional information (metadata) describing the degradation process (such as the blur kernel applied, compression level, etc.) can guide networks to super-resolve LR images with higher fidelity to the original source. Previous attempts at informing SR networks with degradation parameters have indeed been able to improve performance in a number of scenarios. However, due to the fully-convolutional nature of many SR networks, most of these metadata fusion methods either require a complete architectural change, or necessitate the addition of significant extra complexity. Thus, these approaches are difficult to introduce into arbitrary SR networks without considerable design alterations. In this letter, we introduce meta-attention, a simple mechanism which allows any SR CNN to exploit the information available in relevant degradation parameters. The mechanism functions by translating the metadata into a channel attention vector, which in turn selectively modulates the network's feature maps. Incorporating meta-attention into SR networks is straightforward, as it requires no specific type of architecture to function correctly. Extensive testing has shown that meta-attention can consistently improve the pixel-level accuracy of state-of-the-art (SOTA) networks when provided with relevant degradation metadata. Despite average memory/runtime overheads of less than $\approx$ 2.6%/0.025 seconds for the datasets and models considered, meta-attention improves the performance for both PSNR and SSIM; for PSNR, the gain onblurred/downsampled (×4) images is of 0.2969 dB (on average) and 0.3320 dB for SOTA general and face SR models, respectively. The coding framework used for this letter is available at: https://github.com/um-dsrg/Super-Resolution-Meta-Attention-Networks .
URI: https://www.um.edu.mt/library/oar/handle/123456789/85808
Appears in Collections:Scholarly Works - FacICTCCE

Files in This Item:
File Description SizeFormat 
Improving_Super-Resolution_Performance_Using_Meta-Attention_Layers.pdf
  Restricted Access
1.32 MBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.