Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Robust fusion for skin lesion segmentation of dermoscopic images

Guo, Qingqing ; Fang, Xianyong ; Wang, Linbo ; Zhang, Enming LU and Liu, Zhengyi (2023) In Frontiers in Bioengineering and Biotechnology 11.
Abstract

Robust skin lesion segmentation of dermoscopic images is still very difficult. Recent methods often take the combinations of CNN and Transformer for feature abstraction and multi-scale features for further classification. Both types of combination in general rely on some forms of feature fusion. This paper considers these fusions from two novel points of view. For abstraction, Transformer is viewed as the affinity exploration of different patch tokens and can be applied to attend CNN features in multiple scales. Consequently, a new fusion module, the Attention-based Transformer-And-CNN fusion module (ATAC), is proposed. ATAC augments the CNN features with more global contexts. For further classification, adaptively combining the... (More)

Robust skin lesion segmentation of dermoscopic images is still very difficult. Recent methods often take the combinations of CNN and Transformer for feature abstraction and multi-scale features for further classification. Both types of combination in general rely on some forms of feature fusion. This paper considers these fusions from two novel points of view. For abstraction, Transformer is viewed as the affinity exploration of different patch tokens and can be applied to attend CNN features in multiple scales. Consequently, a new fusion module, the Attention-based Transformer-And-CNN fusion module (ATAC), is proposed. ATAC augments the CNN features with more global contexts. For further classification, adaptively combining the information from multiple scales according to their contributions to object recognition is expected. Accordingly, a new fusion module, the GAting-based Multi-Scale fusion module (GAMS), is also introduced, which adaptively weights the information from multiple scales by the light-weighted gating mechanism. Combining ATAC and GAMS leads to a new encoder-decoder-based framework. In this method, ATAC acts as an encoder block to progressively abstract strong CNN features with rich global contexts attended by long-range relations, while GAMS works as an enhancement of the decoder to generate the discriminative features through adaptive fusion of multi-scale ones. This framework is especially good at lesions of varying sizes and shapes and of low contrasts and its performances are demonstrated with extensive experiments on public skin lesion segmentation datasets.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
deep learning, gating mechanism, robust fusion, skin lesion segmentation, transformer
in
Frontiers in Bioengineering and Biotechnology
volume
11
article number
1057866
publisher
Frontiers Media S. A.
external identifiers
  • scopus:85151511295
  • pmid:37020509
ISSN
2296-4185
DOI
10.3389/fbioe.2023.1057866
language
English
LU publication?
yes
id
2637d69e-f29c-4045-a1a7-87a09c5f18b0
date added to LUP
2023-05-23 15:06:07
date last changed
2024-04-19 22:09:27
@article{2637d69e-f29c-4045-a1a7-87a09c5f18b0,
  abstract     = {{<p>Robust skin lesion segmentation of dermoscopic images is still very difficult. Recent methods often take the combinations of CNN and Transformer for feature abstraction and multi-scale features for further classification. Both types of combination in general rely on some forms of feature fusion. This paper considers these fusions from two novel points of view. For abstraction, Transformer is viewed as the affinity exploration of different patch tokens and can be applied to attend CNN features in multiple scales. Consequently, a new fusion module, the Attention-based Transformer-And-CNN fusion module (ATAC), is proposed. ATAC augments the CNN features with more global contexts. For further classification, adaptively combining the information from multiple scales according to their contributions to object recognition is expected. Accordingly, a new fusion module, the GAting-based Multi-Scale fusion module (GAMS), is also introduced, which adaptively weights the information from multiple scales by the light-weighted gating mechanism. Combining ATAC and GAMS leads to a new encoder-decoder-based framework. In this method, ATAC acts as an encoder block to progressively abstract strong CNN features with rich global contexts attended by long-range relations, while GAMS works as an enhancement of the decoder to generate the discriminative features through adaptive fusion of multi-scale ones. This framework is especially good at lesions of varying sizes and shapes and of low contrasts and its performances are demonstrated with extensive experiments on public skin lesion segmentation datasets.</p>}},
  author       = {{Guo, Qingqing and Fang, Xianyong and Wang, Linbo and Zhang, Enming and Liu, Zhengyi}},
  issn         = {{2296-4185}},
  keywords     = {{deep learning; gating mechanism; robust fusion; skin lesion segmentation; transformer}},
  language     = {{eng}},
  publisher    = {{Frontiers Media S. A.}},
  series       = {{Frontiers in Bioengineering and Biotechnology}},
  title        = {{Robust fusion for skin lesion segmentation of dermoscopic images}},
  url          = {{http://dx.doi.org/10.3389/fbioe.2023.1057866}},
  doi          = {{10.3389/fbioe.2023.1057866}},
  volume       = {{11}},
  year         = {{2023}},
}