Gated Path Selection Network for Semantic Segmentation

Qichuan Geng, Hong Zhang, Xiaojuan Qi, Gao Huang, Ruigang Yang, Zhong Zhou

Research output: Contribution to journalArticlepeer-review

26 Scopus citations

Abstract

Semantic segmentation is a challenging task that needs to handle large scale variations, deformations, and different viewpoints. In this paper, we develop a novel network named Gated Path Selection Network (GPSNet), which aims to adaptively select receptive fields while maintaining the dense sampling capability. In GPSNet, we first design a two-dimensional SuperNet, which densely incorporates features from growing receptive fields. And then, a Comparative Feature Aggregation (CFA) module is introduced to dynamically aggregate discriminative semantic context. In contrast to previous works that focus on optimizing sparse sampling locations on regular grids, GPSNet can adaptively harvest free form dense semantic context information. The derived adaptive receptive fields and dense sampling locations are data-dependent and flexible which can model various contexts of objects. On two representative semantic segmentation datasets, i.e., Cityscapes and ADE20K, we show that the proposed approach consistently outperforms previous methods without bells and whistles.

Original languageEnglish
Article number9318517
Pages (from-to)2436-2449
Number of pages14
JournalIEEE Transactions on Image Processing
Volume30
DOIs
StatePublished - 2021

Bibliographical note

Publisher Copyright:
© 2021 IEEE.

Funding

Manuscript received April 2, 2020; revised October 10, 2020; accepted December 5, 2020. Date of publication January 8, 2021; date of current version February 1, 2021. This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB2100601 and in part by the National Natural Science Foundation of China (NSFC) under Grant 61872024. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Raja Bala. (Corresponding author: Zhong Zhou.) Qichuan Geng is with the State Key Laboratory of Virtual Reality Technology and Systems, School of Computer Science and Engineering, Beihang University, Beijing 100191, China. His work was partially done when he was an intern at Baidu Research, Beijing 100193, China (e-mail: zhaokefirst@ buaa.edu.cn).

FundersFunder number
National Natural Science Foundation of China (NSFC)61872024
National Key Research and Development Program of China2018YFB2100601

    Keywords

    • Semantic segmentation
    • adaptive context aggregation
    • adaptive receptive fields and sampling locations
    • local discriminative feature

    ASJC Scopus subject areas

    • Software
    • Computer Graphics and Computer-Aided Design

    Fingerprint

    Dive into the research topics of 'Gated Path Selection Network for Semantic Segmentation'. Together they form a unique fingerprint.

    Cite this