• 1. School of Information Science and Engineering, Yunnan University, Kunming 650504, P. R. China;
  • 2. The Second People's Hospital of Honghe Prefecture, Jianshui, Yunnan 654300, P. R. China;
ZHOU Dongming, Email: zhoudm@ynu.edu.cn
Export PDF Favorites Scan Get Citation

In response to the issues of single-scale information loss and large model parameter size during the sampling process in U-Net and its variants for medical image segmentation, this paper proposes a multi-scale medical image segmentation method based on pixel encoding and spatial attention. Firstly, by redesigning the input strategy of the Transformer structure, a pixel encoding module is introduced to enable the model to extract global semantic information from multi-scale image features, obtaining richer feature information. Additionally, deformable convolutions are incorporated into the Transformer module to accelerate convergence speed and improve module performance. Secondly, a spatial attention module with residual connections is introduced to allow the model to focus on the foreground information of the fused feature maps. Finally, through ablation experiments, the network is lightweighted to enhance segmentation accuracy and accelerate model convergence. The proposed algorithm achieves satisfactory results on the Synapse dataset, an official public dataset for multi-organ segmentation provided by the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), with Dice similarity coefficient (DSC) and 95% Hausdorff distance (HD95) scores of 77.65 and 18.34, respectively. The experimental results demonstrate that the proposed algorithm can enhance multi-organ segmentation performance, potentially filling the gap in multi-scale medical image segmentation algorithms, and providing assistance for professional physicians in diagnosis.

Citation: WAN Yulong, ZHOU Dongming, WANG Changcheng, LIU Yisong, BAI Chongbin. Multi-scale medical image segmentation based on pixel encoding and spatial attention mechanism. Journal of Biomedical Engineering, 2024, 41(3): 511-519. doi: 10.7507/1001-5515.202310001 Copy

  • Previous Article

    A design of interactive review for computer aided diagnosis of pulmonary nodules based on active learning
  • Next Article

    Research on intelligent tooth segmentation method combining multiple seed region growth and boundary extension