Cite
Notes
Only stored in your browser.
Attribution
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
arXiv 2025
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge
arXiv 2023
from 2 papers
Hanjing Wang
Jun Wang
Kai Zhang
Linyi Yang
Mark Schmidt
Ruilong Dan
Shuyue Hu
Steve Jiang
Weinan Zhang
Xiaoyu Wen