Cite
Notes
Only stored in your browser.
Attribution
T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning
arXiv 2026
AstraFlow: Dataflow-Oriented Reinforcement Learning for Agentic LLMs
On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures
arXiv 2023
from 3 papers
Z. Morley Mao
Beidi Chen
Chenwei Zhang
Haixin Wang
Haizhong Zheng
Hejie Cui
Ion Stoica
professor / co-founder
Jiachen Sun
Jiahui Wang
Jiawei Zhao