Heung-Yeung Shum
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
arXiv 2026
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model
arXiv 2025
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
arXiv 2025
Step-Audio 2 Technical Report
arXiv 2025
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
arXiv 2025
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model
arXiv 2025
Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
arXiv 2024
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph
arXiv 2023
HumanTOMATO: Text-aligned Whole-body Motion Generation
arXiv 2023
Locally Attentional SDF Diffusion for Controllable 3D Shape Generation
arXiv 2023
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
dino-detr-with-improved-denoising-anchor
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
mask-dino-towards-a-unified-transformer-based
Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis
CVPR 2023 1
Affiliations
Frequent co-authors
10from 15 papers