We introduce a learning-based depth map fusion framework that accepts a set of depth and confidence maps generated by a Multi-View Stereo (MVS) algorithm as input and improves them. This is accomplished by integrating volumetric visibility constraints that encode long-range surface relationships across different views into an end-to-end trainable architecture. We also introduce a depth search window estimation sub-network trained jointly with the larger fusion sub-network to reduce the depth hypothesis search space along each ray. Our method learns to model depth consensus and violations of visibility constraints directly from the data; effectively removing the necessity of fine-tuning fusion parameters. Extensive experiments on MVS datasets show substantial improvements in the accuracy of the output fused depth and confidence maps.
V-FUSE: Volumetric Depth Map Fusion with Long-Range Constraints
A learning-based framework fuses depth and confidence maps using volumetric visibility constraints and depth search window estimation, improving accuracy on MVS datasets without the need for fine-tuning.
- Year
- 2023
- Venue
- ICCV 2023 1
- Authors
- 2
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2308.08715ARXIV-DEFAULT
- TL;DR
- Semantic Scholar