MonStereo: When Monocular and Stereo Meet at the Tail of 3D Human Localization

Lorenzo Bertoni,Sven Kreiss,Taylor Mordan,Alexandre Alahi,Lorenzo Bertoni,Sven Kreiss,Taylor Mordan,Alexandre Alahi

Monocular and stereo visions are cost-effective solutions for 3D human localization in the context of self-driving cars or social robots. However, they are usually developed independently and have their respective strengths and limitations. We propose a novel unified learning framework that leverages the strengths of both monocular and stereo cues for 3D human localization. Our method jointly (i) ...