ShiMin Li
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10MOSS-TTS Technical Report
arXiv 2026
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models
arXiv 2026
MOVA: Towards Scalable and Synchronized Video-Audio Generation
arXiv 2026
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions
arXiv 2025
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems
arXiv 2025
SpeechAlign: Aligning Speech Generation to Human Preferences
arXiv 2024
Case2Code: Learning Inductive Reasoning with Synthetic Data
arXiv 2024
Cross-Modality Safety Alignment
arXiv 2024
Can AI Assistants Know What They Don't Know?
arXiv 2024
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers