Cite
Notes
Only stored in your browser.
Attribution
Mimic In-Context Learning for Multimodal Tasks
CVPR 2025 1
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
arXiv 2025
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
from 3 papers
Xu Yang
Xin Geng
Xinting Hu
Chenduo Hao
Gongrui Zhang
Jiale Fu
Jie Liu
Kai Yang
Lu Qi
Miaosen Zhang