Qingkai Fang

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

arXiv 2025

2025

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

arXiv 2025

2025

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

arXiv 2025

2025

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

arXiv 2025

2025

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning

arXiv 2024

2024

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

arXiv 2024

2024

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

arXiv 2024

2024

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation

daspeech-directed-acyclic-transformer-for

2023

Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Yang Feng

Shaolei Zhang

Shoutao Guo

Yan Zhou

Zhengrui Ma

Bingquan Xia

Bowen Shen

Bowen Ye

Can Cai

Chenhong He