Intra-inter Modal Attention Blocks for RGB-D Semantic Segmentation

  • Choi, Soyun
  • Zhang, Youjia
  • Hong, Sungeun
Citations

WEB OF SCIENCE

5
Citations

SCOPUS

6

초록

In this paper, we introduce a novel approach to address the challenge of effectively utilizing both RGB and depth information for semantic segmentation. Our approach, Intra-inter Modal Attention (IMA) blocks, considers both intra-modal and inter-modal aspects of the information to produce better results than prior methods which primarily focused on inter-modal relationships. The IMA blocks consist of a cross-modal non-local module and an adaptive channel-wise fusion module. The cross-modal non-local module captures both intra-modal and inter-modal variations at the spatial level through inter-modality parameter sharing, while the adaptive channel-wise fusion module refines the spatially-correlated features. Experimental results on RGB-D benchmark datasets demonstrate consistent performance improvements over various baseline segmentation networks when using the IMA blocks. Our in-depth analysis provides comprehensive results on the impact of intra-, inter-, and intra-inter modal attention on RGB-D segmentation.

키워드

RGB-D Semantic SegmentationMulti-modal FusionNon-local AttentionAGGREGATION
제목
Intra-inter Modal Attention Blocks for RGB-D Semantic Segmentation
저자
Choi, SoyunZhang, YoujiaHong, Sungeun
DOI
10.1145/3591106.3592235
발행일
2023
유형
Proceedings Paper
저널명
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023
페이지
217 ~ 225