Cite
Notes
Only stored in your browser.
Attribution
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
arXiv 2025
from 1 papers
Jun Wang
Linyi Yang
Mark Schmidt
Shuyue Hu
Weinan Zhang
Xiaoyu Wen
Yan Song
Ying Wen
Yunxiang Li
Ziyu Wan