Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

Shiyang Lu, Yunfu Deng, Abdeslam Boularias, Kostas Bekris

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This work proposes a self-supervised learning system for segmenting rigid objects in RGB images. The proposed pipeline is trained on unlabeled RGB-D videos of static objects, which can be captured with a camera carried by a mobile robot. A key feature of the self-supervised training process is a graph-matching algorithm that operates on the over-segmentation output of the point cloud that is reconstructed from each video. The graph matching, along with point cloud registration, is able to find reoccurring object patterns across videos and combine them into 3D object pseudo labels, even under occlusions or different viewing angles. Projected 2D object masks from 3D pseudo labels are used to train a pixel-wise feature extractor through contrastive learning. During online inference, a clustering method uses the learned features to cluster foreground pixels into object segments. Experiments highlight the method's effectiveness on both real and synthetic video datasets, which include cluttered scenes of tabletop objects. The proposed method outperforms existing unsupervised methods for object segmentation by a large margin.

Original languageEnglish (US)
Title of host publicationProceedings - ICRA 2023
Subtitle of host publicationIEEE International Conference on Robotics and Automation
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages7017-7023
Number of pages7
ISBN (Electronic)9798350323658
DOIs
StatePublished - 2023
Event2023 IEEE International Conference on Robotics and Automation, ICRA 2023 - London, United Kingdom
Duration: May 29 2023Jun 2 2023

Publication series

NameProceedings - IEEE International Conference on Robotics and Automation
Volume2023-May

Conference

Conference2023 IEEE International Conference on Robotics and Automation, ICRA 2023
Country/TerritoryUnited Kingdom
CityLondon
Period5/29/236/2/23

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos'. Together they form a unique fingerprint.

Cite this