Yan Zhou

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

arXiv 2025

2025

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

arXiv 2025

2025

Can Multimodal Large Language Models Understand Spatial Relations?

arXiv 2025

2025

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

arXiv 2025

2025

AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models

arXiv 2025

2025

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

arXiv 2024

2024

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

arXiv 2024

2024

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation

daspeech-directed-acyclic-transformer-for

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Qingkai Fang

Yang Feng

Shaolei Zhang

Shoutao Guo

Bin Xia

Haiyun Jiang

Jiaya Jia

Jiehui Huang

Jinchuan Zhang

Jingping Liu