Audio Interaction Model
3 Jun 2026
Audio is an inherently interactive modality, yet today's Large Audio Language Models (LALMs) are offline, and streaming audio models each handle only a single task such as streaming ASR or voice chatting.
Trending research and the full catalog - each paper linked to the benchmarks, methods, and models it introduces.
3 Jun 2026
Audio is an inherently interactive modality, yet today's Large Audio Language Models (LALMs) are offline, and streaming audio models each handle only a single task such as streaming ASR or voice chatting.