Cite
Notes
Only stored in your browser.
Attribution
Inference-time Alignment in Continuous Space
arXiv 2025
Incentivizing Reasoning from Weak Supervision
CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL
Robust Recommender System: A Survey and Future Directions
arXiv 2023
from 4 papers
Bingbing Xu
Bolin Ding
HuaWei Shen
Teng Xiao
Xueqi Cheng
Yige Yuan
Anni Zou
Cheng Chen
Fei Sun
Jinyang Gao