Gabor Filter-Embedded U-Net with Transformer-Based Encoding for Biomedical Image Segmentation
Document Type
Conference Proceeding
Publication Date
10-12-2022
Department
Department of Applied Computing
Abstract
Medical image segmentation involves a process of categorization of target regions that are typically varied in terms of shape, orientation and scales. This requires highly accurate algorithms as marginal segmentation errors in medical images may lead to inaccurate diagnosis in subsequent procedures. The U-Net framework has become one of the dominant deep neural network architectures for medical image segmentation. Due to complex and irregular shape of objects involved in medical images, robust feature representations that correspond to various spatial transformations are key to achieve successful results. Although U-Net-based deep architectures can perform feature extraction and localization, the design of specialized architectures or layer modifications is often an intricate task. In this paper, we propose an effective solution to this problem by introducing Gabor filter banks into the U-Net encoder, which has not yet been well explored in existing U-Net-based segmentation frameworks. In addition, global self-attention mechanisms and Transformer layers are also incorporated into the U-Net framework to capture global contexts. Through extensive testing on two benchmark datasets, we show that the Gabor filter-embedded U-Net with Transformer encoders can enhance the robustness of deep-learned features, and thus achieve a more competitive performance.
Publication Title
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISBN
9783031188138
Recommended Citation
Reyes, A. A.,
Paheding, S.,
Deo, M.,
&
Audette, M.
(2022).
Gabor Filter-Embedded U-Net with Transformer-Based Encoding for Biomedical Image Segmentation.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),
13594 LNCS, 76-88.
http://doi.org/10.1007/978-3-031-18814-5_8
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p/16610
Publisher's Statement
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG. Publisher’s version of record: https://doi.org/10.1007/978-3-031-18814-5_8