SGDC: 구조 기반 동적 컨볼루션을 이용한 의료 영상 분할

초록

공간적으로 변동하는 동적 합성곱은 심층 신경망에 공간 적응성을 통합하는 원칙적인 접근법을 제공합니다. 그러나 의료 영상 분할 분야의 주류 설계는 일반적으로 평균 풀링을 통해 동적 커널을 생성하는데, 이는 암묵적으로 고주파 공간 세부 정보를 거친 공간 압축 표현으로 축소하여 과도하게 평활화된 예측을 초래하며, 세밀한 임상 구조의 정확도를 저하시킵니다. 이러한 한계를 해결하기 위해 본 연구에서는 명시적으로 지도되는 구조 추출 브랜치를 활용하여 구조 인식 특징 변조를 위한 동적 커널 및 게이팅 신호 생성을 안내하는 새로운 구조 기반 동적 합성곱(SGDC) 메커니즘을 제안합니다. 구체적으로, 이 보조 브랜치에서 얻은 높은 정확도의 경계 정보가 의미론적 특징과 융합되어 공간적으로 정밀한 특징 변조를 가능하게 합니다. 컨텍스트 집계 대신 픽셀 단위 구조 안내를 사용함으로써, 제안된 설계는 평균 풀링으로 인한 정보 손실을 효과적으로 방지합니다. 실험 결과, SGDC는 ISIC 2016, PH2, ISIC 2018 및 CoNIC 데이터셋에서 최첨단 성능을 달성하여 Hausdorff 거리(HD95)를 2.05 감소시키는 우수한 경계 정확도를 제공하고, 풀링 기반 기준 모델 대비 0.99%~1.49%의 지속적인 IoU 향상을 제공합니다. 더욱이, 이 메커니즘은 소형 객체 감지와 같은 다른 세밀하고 구조에 민감한 비전 작업으로의 확장에 대한 강력한 잠재력을 보여주며, 의료 영상 분석에서 구조적 무결성을 보존하기 위한 원칙적인 해결책을 제시합니다. 재현성을 용이하게 하고 추가 연구를 장려하기 위해 SGE 및 SGDC 모듈의 구현 코드는 https://github.com/solstice0621/SGDC에서 공개되었습니다.

English

Spatially variant dynamic convolution provides a principled approach of integrating spatial adaptivity into deep neural networks. However, mainstream designs in medical segmentation commonly generate dynamic kernels through average pooling, which implicitly collapses high-frequency spatial details into a coarse, spatially-compressed representation, leading to over-smoothed predictions that degrade the fidelity of fine-grained clinical structures. To address this limitation, we propose a novel Structure-Guided Dynamic Convolution (SGDC) mechanism, which leverages an explicitly supervised structure-extraction branch to guide the generation of dynamic kernels and gating signals for structure-aware feature modulation. Specifically, the high-fidelity boundary information from this auxiliary branch is fused with semantic features to enable spatially-precise feature modulation. By replacing context aggregation with pixel-wise structural guidance, the proposed design effectively prevents the information loss introduced by average pooling. Experimental results show that SGDC achieves state-of-the-art performance on ISIC 2016, PH2, ISIC 2018, and CoNIC datasets, delivering superior boundary fidelity by reducing the Hausdorff Distance (HD95) by 2.05, and providing consistent IoU gains of 0.99\%-1.49\% over pooling-based baselines. Moreover, the mechanism exhibits strong potential for extension to other fine-grained, structure-sensitive vision tasks, such as small-object detection, offering a principled solution for preserving structural integrity in medical image analysis. To facilitate reproducibility and encourage further research, the implementation code for both our SGE and SGDC modules has been is publicly released at https://github.com/solstice0621/SGDC.

SGDC: 구조 기반 동적 컨볼루션을 이용한 의료 영상 분할

SGDC: Structurally-Guided Dynamic Convolution for Medical Image Segmentation

초록

Support