Yiwen Tang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
arXiv 2026
Hume: Introducing System-2 Thinking in Visual-Language-Action Model
arXiv 2025
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
arXiv 2025
AutoMat: Enabling Automated Crystal Structure Reconstruction from Microscopy via Agentic Tool Use
arXiv 2025
Exploring the Potential of Encoder-free Architectures in 3D LMMs
arXiv 2025
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding
arXiv 2024
ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance
arXiv 2023
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
arXiv 2023
Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
arXiv 2023
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
ICCV 2023 1
Affiliations
Frequent co-authors
10from 10 papers