Zhaoye Fei
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10MOSS-TTS Technical Report
arXiv 2026
World Action Models: The Next Frontier in Embodied AI
arXiv 2026
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models
arXiv 2026
MOVA: Towards Scalable and Synchronized Video-Audio Generation
arXiv 2026
VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search
arXiv 2025
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
arXiv 2025
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models
arXiv 2025
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems
arXiv 2025
RoboOmni: Proactive Robot Manipulation in Omni-modal Context
arXiv 2025
Balanced Data Sampling for Language Model Training with Clustering
arXiv 2024
Affiliations
Frequent co-authors
10from 10 papers