Cite
Notes
Only stored in your browser.
Attribution
Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning
arXiv 2025
from 1 papers
Eugene Ie
Shenao Zhang
Tianqi Liu
Yaqing Wang
Yinxiao Liu
Yunxuan Li
Zhaoran Wang