Cite
Notes
Only stored in your browser.
Attribution
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
arXiv 2025
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
arXiv 2024
Woodpecker: Hallucination Correction for Multimodal Large Language Models
arXiv 2023
from 3 papers
Enhong Chen
Chaoyou Fu
Shukang Yin
Tong Xu
Xing Sun
Yunhang Shen
Caifeng Shan
Chunjiang Ge
Dianbo Sui
Fangtai Wu