Kun Ouyang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
Kimi-VL Technical Report
arXiv 2025
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
arXiv 2025
SpaceR: Reinforcing MLLMs in Video Spatial Reasoning
arXiv 2025
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?
arXiv 2025
Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence
arXiv 2025
TEMPLE:Temporal Preference Learning of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment
arXiv 2025
Sentiment-enhanced Graph-based Sarcasm Explanation in Dialogue
arXiv 2024
PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
arXiv 2024
Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers