Cite
Notes
Only stored in your browser.
Attribution
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play
arXiv 2026
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
CVPR 2025 1
from 2 papers
Xiaocheng Feng
Bing Qin
Bo Zheng
Da Chen
Deyi Yin
Ganqu Cui
researcher
Haoye Zhang
Jun Song
Lei Huang
Libo Qin