Cite
Notes
Only stored in your browser.
Attribution
Adaptation of Agentic AI
arXiv 2025
Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
from 3 papers
Jiacheng Lin
Kun Qian
Changran Hu
Chao Zhang
Chaoqi Yang
Dawn Song
professor
Dylan Zhang
Ge Li
Hanwen Xu
Hao Peng