Haoyang Huang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
arXiv 2026
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models
arXiv 2026
CodeTracer: Towards Traceable Agent States
arXiv 2026
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence
arXiv 2026
Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation
arXiv 2026
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
arXiv 2026
OmniForcing: Unleashing Real-time Joint Audio-Visual Generation
arXiv 2026
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model
arXiv 2025
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
arXiv 2024
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References
arXiv 2023
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models
arXiv 2023
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
arXiv 2022
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
arXiv 2022
UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation
arXiv 2020
Affiliations
Frequent co-authors
10from 15 papers