Improving super-resolution performance using meta-attention layers

Aquilina, Matthew; Galea, Christian; Abela, John; Camilleri, Kenneth P.; Farrugia, Reuben A.

Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/85808

Title:	Improving super-resolution performance using meta-attention layers
Authors:	Aquilina, Matthew Galea, Christian Abela, John Camilleri, Kenneth P. Farrugia, Reuben A.
Keywords:	Image processing Optical data processing Computer graphics Pattern recognition Image reconstruction
Issue Date:	2021
Publisher:	IEEE
Citation:	Aquilina, M., Galea, C., Abela, J., Camilleri, K. P., & Farrugia, R. A. (2021). Improving super-resolution performance using meta-attention layers. IEEE Signal Processing Letters, 28, 2082-2086.
Abstract:	Convolutional Neural Networks (CNNs) have achieved impressive results across many super-resolution (SR) and image restoration tasks. While many such networks can upscalelow-resolution (LR) images using just the raw pixel-level information, the ill-posed nature of SR can make it difficult to accurately super-resolve an image which has undergone multiple different degradations. Additional information (metadata) describing the degradation process (such as the blur kernel applied, compression level, etc.) can guide networks to super-resolve LR images with higher fidelity to the original source. Previous attempts at informing SR networks with degradation parameters have indeed been able to improve performance in a number of scenarios. However, due to the fully-convolutional nature of many SR networks, most of these metadata fusion methods either require a complete architectural change, or necessitate the addition of significant extra complexity. Thus, these approaches are difficult to introduce into arbitrary SR networks without considerable design alterations. In this letter, we introduce meta-attention, a simple mechanism which allows any SR CNN to exploit the information available in relevant degradation parameters. The mechanism functions by translating the metadata into a channel attention vector, which in turn selectively modulates the network's feature maps. Incorporating meta-attention into SR networks is straightforward, as it requires no specific type of architecture to function correctly. Extensive testing has shown that meta-attention can consistently improve the pixel-level accuracy of state-of-the-art (SOTA) networks when provided with relevant degradation metadata. Despite average memory/runtime overheads of less than $\approx$ 2.6%/0.025 seconds for the datasets and models considered, meta-attention improves the performance for both PSNR and SSIM; for PSNR, the gain onblurred/downsampled (×4) images is of 0.2969 dB (on average) and 0.3320 dB for SOTA general and face SR models, respectively. The coding framework used for this letter is available at: https://github.com/um-dsrg/Super-Resolution-Meta-Attention-Networks .
URI:	https://www.um.edu.mt/library/oar/handle/123456789/85808
Appears in Collections:	Scholarly Works - FacICTCCE

Files in This Item:

File	Description	Size	Format
Improving_Super-Resolution_Performance_Using_Meta-Attention_Layers.pdf Restricted Access		1.32 MB	Adobe PDF	View/Open Request a copy

Show full item record Statistics