Yi Wan

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold

arXiv 2025

Pearl: A Production-ready Reinforcement Learning Agent

arXiv 2023

No known affiliations.

from 2 papers

Zheqing Zhu

Alex Nikulkov

Daniel Jiang

Dmytro Korenkevych

Frank Cheng

Hongbo Guo

Jalaj Bhandari

Jinsong Liu

Jiuqi Wang

Liam Li