Honglin Guo
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
arXiv 2026
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction
arXiv 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
arXiv 2025
Pre-Trained Policy Discriminators are General Reward Models
arXiv 2025
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
arXiv 2025
BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset
arXiv 2025
TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models
ICCV 2025
CritiQ: Mining Data Quality Criteria from Human Preferences
arXiv 2025
Better Process Supervision with Bi-directional Rewarding Signals
arXiv 2025
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
arXiv 2025
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
arXiv 2024
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
arXiv 2024
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers