Huatong Song
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training
arXiv 2026
LLM-in-Sandbox Elicits General Agentic Intelligence
arXiv 2026
SWE-World: Building Software Engineering Agents in Docker-Free Environments
arXiv 2026
BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?
arXiv 2026
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
arXiv 2025
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
arXiv 2025
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
arXiv 2025
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
arXiv 2025
YuLan-Mini: An Open Data-efficient Language Model
arXiv 2024
Affiliations
Frequent co-authors
10from 9 papers