Cite
Notes
Only stored in your browser.
Attribution
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play
arXiv 2026
SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution
from 2 papers
Bing Qin
Lei Huang
Libo Qin
Lingpeng Kong
Weitao Ma
Xiachong Feng
Xiaocheng Feng
Yangfan Ye
Yi Jiang
Yuxuan Gu