Michigan Tech Publications

Fractal, recurrent, and dense U-Net architectures with EfficientNet encoder for medical image segmentation

Nahian Siddique, Purdue University Northwest
Sidike Paheding, Michigan Technological UniversityFollow
Abel A. Reyes Angulo, Purdue University Northwest
Md Zahangir Alom, St. Jude Children's Research Hospital
Vijay K. Devabhaktuni, University of Maine

Document Type

Article

Publication Date

12-2022

Department

Department of Applied Computing

Abstract

PURPOSE: U-Net is a deep learning technique that has made significant contributions to medical image segmentation. Although the accomplishments of deep learning algorithms in terms of image processing are evident, many challenges still need to be overcome to achieve human-like performance. One of the main challenges in building deeper U-Nets is black-box problems, such as vanishing gradients. Overcoming this problem allows us to develop neural networks with advanced network designs. APPROACH: We propose three U-Net variants, namely efficient R2U-Net, efficient dense U-Net, and efficient fractal U-Net, that can create highly accurate segmentation maps. The first part of our contribution makes use of EfficientNet to distribute resources in the network efficiently. The second part of our work applies the following layer connections to design the U-Net decoders: residual connections, dense connections, and fractal expansion. We apply EfficientNet as the encoder to our three decoders to design three conceivable models. RESULTS: The aforementioned three proposed deep learning models were tested on four benchmark datasets, including the CHASE DB1 and digital retinal images for vessel extraction (DRIVE) retinal image databases and the ISIC 2018 and HAM10000 dermoscopy image databases. We obtained the highest Dice coefficient of 0.8013, 0.8808, 0.8019, and 0.9295 for CHASE DB1, ISIC 2018, DRIVE, and HAM10000, respectively, and a Jaccard (JAC) score of 0.6686, 0.7870, 0.6694, and 0.8683 for CHASE DB1, ISIC 2018, DRIVE, and HAM10000, respectively. Statistical analysis revealed that the proposed deep learning models achieved better segmentation results compared with the state-of-the-art models. CONCLUSIONS: U-Net is quite an adaptable deep learning framework and can be integrated with other deep learning techniques. The use of recurrent feedback connections, dense convolution, residual skip connections, and fractal convolutional expansions allow for the design of improved deeper U-Net models. With the addition of EfficientNet, we can now leverage the performance of an optimally scaled classifier for U-Net encoders.

Publication Title

Journal of medical imaging (Bellingham, Wash.)

Recommended Citation

Siddique, N., Paheding, S., Reyes Angulo, A. A., Alom, M. Z., & Devabhaktuni, V. K. (2022). Fractal, recurrent, and dense U-Net architectures with EfficientNet encoder for medical image segmentation. Journal of medical imaging (Bellingham, Wash.), 9(6), 064004. http://doi.org/10.1117/1.JMI.9.6.064004
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p/16777

Link to Full Text

COinS

Michigan Tech Publications

Fractal, recurrent, and dense U-Net architectures with EfficientNet encoder for medical image segmentation

Document Type

Publication Date

Department

Abstract

Publication Title

Recommended Citation

LINKS

Browse

Search

Author Corner

Links

Michigan Tech Publications

Fractal, recurrent, and dense U-Net architectures with EfficientNet encoder for medical image segmentation

Authors

Document Type

Publication Date

Department

Abstract

Publication Title

Recommended Citation

Share

LINKS

Browse

Search

Author Corner

Links