Qingkai Fang
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
arXiv 2025
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
arXiv 2025
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
arXiv 2025
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
arXiv 2025
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
arXiv 2024
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
arXiv 2024
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment
arXiv 2024
DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation
daspeech-directed-acyclic-transformer-for
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers