Cite
Notes
Only stored in your browser.
Attribution
PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold
arXiv 2025
Pearl: A Production-ready Reinforcement Learning Agent
arXiv 2023
from 2 papers
Zheqing Zhu
Alex Nikulkov
Daniel Jiang
Dmytro Korenkevych
Frank Cheng
Hongbo Guo
Jalaj Bhandari
Jinsong Liu
Jiuqi Wang
Liam Li