Bo An
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15AgentOrchestra: Orchestrating Multi-Agent Intelligence with the Tool-Environment-Agent(TEA) Protocol
arXiv 2025
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
arXiv 2026
AgentOCR: Reimagining Agent History via Optical Self-Compression
arXiv 2026
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
arXiv 2025
Group-in-Group Policy Optimization for LLM Agent Training
arXiv 2025
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems
arXiv 2025
Skywork Open Reasoner 1 Technical Report
arXiv 2025
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies
arXiv 2025
LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources
arXiv 2025
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
arXiv 2025
Cradle: Empowering Foundation Agents Towards General Computer Control
arXiv 2024
MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts
arXiv 2024
Dual-Head Knowledge Distillation: Enhancing Logits Utilization with an Auxiliary Head
arXiv 2024
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading
arXiv 2024
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence
arXiv 2023
Affiliations
Frequent co-authors
10from 15 papers