Delin Qu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
arXiv 2025
Hume: Introducing System-2 Thinking in Visual-Language-Action Model
arXiv 2025
Uni$\textbf{F}^2$ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models
arXiv 2025
Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge
arXiv 2025
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
arXiv 2025
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
arXiv 2025
Exploring the Potential of Encoder-free Architectures in 3D LMMs
arXiv 2025
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
arXiv 2025
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control
arXiv 2024
Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction
ICCV 2023 1
Affiliations
Frequent co-authors
10from 10 papers