Linli Yao
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions
arXiv 2026
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents
arXiv 2026
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
arXiv 2025
RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction
arXiv 2025
Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence
arXiv 2025
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration
arXiv 2025
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
arXiv 2024
Temporal Reasoning Transfer from Text to Video
arXiv 2024
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding
CVPR 2024 1
Rethinking Benchmarks for Cross-modal Image-text Retrieval
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers