Yuanxin Liu
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions
arXiv 2026
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
Kimi-VL Technical Report
arXiv 2025
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
arXiv 2025
SpaceR: Reinforcing MLLMs in Video Spatial Reasoning
arXiv 2025
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?
arXiv 2025
RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction
arXiv 2025
Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence
arXiv 2025
TEMPLE:Temporal Preference Learning of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment
arXiv 2025
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?
arXiv 2025
TempCompass: Do Video LLMs Really Understand Videos?
arXiv 2024
Temporal Reasoning Transfer from Text to Video
arXiv 2024
PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
arXiv 2024
VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models
arXiv 2023
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
arXiv 2022
Affiliations
Frequent co-authors
10from 16 papers