Joint spatial-depth feature pooling for RGB-D object classification

Hong Pan, Søren Ingvor Olsen, Yaping Zhu

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

6 Downloads (Pure)

Abstract

RGB-D camera can provide effective support with additional depth cue for many RGB-D perception tasks beyond traditional RGB information. However, current feature representations based on RGB-D camera utilize depth information only to extract local features, without considering it for the improvement of robustness and discriminability of the feature representation by merging depth cues into feature pooling. Spatial pyramid model (SPM) has become the standard protocol to split 2D image plane into sub-regions for feature pooling in RGB-D object classification. We argue that SPM may not be the optimal pooling scheme for RGB-D images, as it only pools features spatially and completely discards the depth topological information. Instead, we propose a novel joint spatial-depth pooling scheme (JSDP) which further partitions SPM using the depth cue and pools features simultaneous in 2D image plane and the depth direction. Embedding the JSDP with the standard feature extraction and feature encoding modules, we achieve superior performance to the state-of-the-art methods on benchmarks for RGB-D object classification and detection.
OriginalsprogEngelsk
TitelImage analysis : 19th Scandinavian Conference, SCIA 2015, Copenhagen, Denmark, June 15-17, 2015. Proceedings
RedaktørerRasmus R. Paulsen, Kim S. Pedersen
Antal sider13
ForlagSpringer
Publikationsdato2015
Sider314-326
ISBN (Trykt)978-3-319-19664-0
ISBN (Elektronisk)978-3-319-19665-7
DOI
StatusUdgivet - 2015
BegivenhedScandinavian Conference, SCIA 2015 - Copenhagen, Danmark
Varighed: 15 jun. 201517 jun. 2015
Konferencens nummer: 19

Konference

KonferenceScandinavian Conference, SCIA 2015
Nummer19
Land/OmrådeDanmark
ByCopenhagen
Periode15/06/201517/06/2015
NavnLecture notes in computer science
Vol/bind9127
ISSN0302-9743

Bibliografisk note

Achieved the Best Paper award of the conference

Citationsformater