Yuanxin Liu

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

arXiv 2026

2026

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

arXiv 2026

2026

Kimi K2.5: Visual Agentic Intelligence

arXiv 2026

2026

Kimi-VL Technical Report

arXiv 2025

2025

TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

arXiv 2025

2025

SpaceR: Reinforcing MLLMs in Video Spatial Reasoning

arXiv 2025

2025

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

arXiv 2025

2025

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction

arXiv 2025

2025

TEMPLE:Temporal Preference Learning of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment

arXiv 2025

2025

Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence

arXiv 2025

2025

UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?

arXiv 2025

2025

TempCompass: Do Video LLMs Really Understand Videos?

arXiv 2024

2024

PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension

arXiv 2024

2024

Temporal Reasoning Transfer from Text to Video

arXiv 2024

2024

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

arXiv 2023

2023

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Xu sun

Kun Ouyang

Lei LI

Linli Yao

Shuhuai Ren

Lingpeng Kong

Qi Liu

Shicheng Li

Yi Liu

HaoNing Wu