VLG | Computer Vision and Learning Group

Authors:Francis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari

Abstract

We propose a method to detect and reconstruct multiple 3D objects from a single RGB image. The key idea is to optimize for detection, alignment and shape jointly over all objects in the RGB image, while focusing on realistic and physically plausible reconstructions. To this end, we propose a keypoint detector that localizes objects as center points and directly predicts all object properties, including 9-DoF bounding boxes and 3D shapes -- all in a single forward pass.

Authors:

Dr. Francis Engelmann
PostDoc at Stanford University

Links:

Project PDF Source BibTeX

Points2Objects: From Points to Multi-Object 3D Reconstruction

Conference: Computer Vision and Pattern Recognition (CVPR), 2021

Abstract

Authors:

Links: