Yi Wu
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
arXiv 2025
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
arXiv 2025
Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective
arXiv 2025
Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions
arXiv 2025
Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL
arXiv 2025
ReaL: Efficient RLHF Training of Large Language Models with Parameter Reallocation
arXiv 2024
SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models
arXiv 2024
Offline Reinforcement Learning for LLM Multi-Step Reasoning
arXiv 2024
LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination
arXiv 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
arXiv 2023
How Effective Are Neural Networks for Fixing Security Vulnerabilities
arXiv 2023
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
arXiv 2023
SOAR: Scene-debiasing Open-set Action Recognition
soar-scene-debiasing-open-set-action
Efficient Backdoor Attacks for Deep Neural Networks in Real-world Scenarios
arXiv 2023
E^2TAD: An Energy-Efficient Tracking-based Action Detector
arXiv 2022
Emergent Tool Use From Multi-Agent Autocurricula
ICLR 2020 1
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
multi-agent-actor-critic-for-mixed-1
Affiliations
Frequent co-authors
10from 17 papers