0

RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects

A multi-modal conditional denoising diffusion model generates high-resolution depth maps from low-resolution RGB images using novel augmentation techniques to enhance robustness.

Year
2023
Venue
arXiv 2023
Authors
6
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2307.15988ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

We present RGB-D-Fusion, a multi-modal conditional denoising diffusion probabilistic model to generate high resolution depth maps from low-resolution monocular RGB images of humanoid subjects. RGB-D-Fusion first generates a low-resolution depth map using an image conditioned denoising diffusion probabilistic model and then upsamples the depth map using a second denoising diffusion probabilistic model conditioned on a low-resolution RGB-D image. We further introduce a novel augmentation technique, depth noise augmentation, to increase the robustness of our super-resolution model.

Authors

6