Junkai Zhang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
arXiv 2025
Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training
arXiv 2025
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization
arXiv 2025
WebSailor: Navigating Super-human Reasoning for Web Agent
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers