Xin Dong
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
arXiv 2026
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
arXiv 2025
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
arXiv 2025
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement
arXiv 2025
Hymba: A Hybrid-head Architecture for Small Language Models
arXiv 2024
DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers