Cite
Notes
Only stored in your browser.
Attribution
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
arXiv 2026
TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs
arXiv 2025
from 2 papers
Ming-Ming Cheng
Qibin Hou
Yunheng Li
Hengrui Zhang
Jiangxia Cao
Jing Cheng
Shaohui Jiao
Shaoyong Jia
Zhaojie Liu