Im2Contact: Vision-Based Contact Localization Without Touch or Force Sensing

Leon Kim, Yunshuang Li, Michael Posa, and Dinesh Jayaraman

In Conference on Robot Learning (CoRL), 2023

Contacts play a critical role in most manipulation tasks. Robots today mainly use proximal touch/force sensors to sense contacts, but the information they provide must be calibrated and is inherently local, with practical applications relying either on extensive surface coverage or restrictive assumptions to resolve ambiguities. We propose a vision-based extrinsic contact localization task: with only a single RGB-D camera view of a robot workspace, identify when and where an object held by the robot contacts the rest of the environment. We show that careful task-attuned design is critical for a neural network trained in simulation to discover solutions that transfer well to a real robot. Our final approach \methodname demonstrates the promise of versatile general-purpose contact perception from vision alone, performing well for localizing various contact types (point, line, or planar; sticking, sliding, or rolling; single or multiple), and even under occlusions in its camera view.

PDF
Publisher Website
Project Website
@inproceedings{Kim2023,
  title = {Im2Contact: Vision-Based Contact Localization Without Touch or Force Sensing},
  author = {Kim, Leon and Li, Yunshuang and Posa, Michael and Jayaraman, Dinesh},
  year = {2023},
  month = nov,
  booktitle = {Conference on Robot Learning (CoRL)},
  url = {https://openreview.net/forum?id=h8halpbqB-},
  website = {https://sites.google.com/view/im2contact/home}
}