This work proposes a novel approach to 4D radar-based scene flow estimation via cross-modal learning. Our approach is motivated by the co-located sensing redundancy in modern autonomous vehicles. Such redundancy implicitly provides various forms of supervision cues to the radar scene flow estimation. Specifically, we introduce a multi-task model architecture for the identified cross-modal learning problem and propose loss functions to opportunistically engage scene flow estimation using multiple cross-modal constraints for effective model training. Extensive experiments show the state-of-the-art performance of our method and demonstrate the effectiveness of cross-modal supervised learning to infer more accurate 4D radar scene flow. We also show its usefulness to two subtasks - motion segmentation and ego-motion estimation. Our source code will be available on https://github.com/Toytiny/CMFlow.
Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision
A multi-task model architecture uses cross-modal learning to improve 4D radar scene flow estimation, enhancing accuracy and applicability to motion segmentation and ego-motion estimation.
- Year
- 2023
- Venue
- CVPR 2023 1
- Authors
- 4
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2303.00462v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar