Cite
Notes
Only stored in your browser.
Attribution
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
arXiv 2023
Regret-Minimizing Double Oracle for Extensive-Form Games
from 2 papers
Jun Wang
Le Cong Dinh
Muning Wen
Weinan Zhang
Xiaohang Tang
Xidong Feng
Yaodong Yang
professor
Ying Wen
Ziyu Wan