Huayu Chen
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
arXiv 2025
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
arXiv 2025
Process Reinforcement through Implicit Rewards
arXiv 2025
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
direct-discriminative-optimization-your
A Survey of Reinforcement Learning for Large Reasoning Models
arXiv 2025
Visual Generation Without Guidance
arXiv 2025
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
arXiv 2024
Free Process Rewards without Process Labels
arXiv 2024
Noise Contrastive Alignment of Language Models with Explicit Rewards
arXiv 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
arXiv 2024
Score Regularized Policy Optimization through Diffusion Behavior
arXiv 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers