Cite
Notes
Only stored in your browser.
Attribution
Efficient Agentic Reasoning Through Self-Regulated Simulative Planning
arXiv 2026
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
arXiv 2025
Medical Dead-ends and Learning to Identify High-risk States and Treatments
NeurIPS 2021 12
from 3 papers
Eric P. Xing
Zhengzhong Liu
Abulhair Saparov
Chengqian Gao
Fan Zhou
Feng Yao
Haonan Li
Jayakumar Subramanian
Jianshu She
Jinyu Hou