Cite
Notes
Only stored in your browser.
Attribution
Natural Language Reinforcement Learning
arXiv 2024
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
arXiv 2023
ChessGPT: Bridging Policy Learning and Language Modeling
chessgpt-bridging-policy-learning-and
from 3 papers
Jun Wang
Mengyue Yang
Ying Wen
Ziyu Wan
Bo Liu
researcher
David Mguni
Girish A. Koushik
Haotian Fu
Hongrui Tang
Kun Shao