Heung-Yeung Shum

Papers: 15

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

15papers

Authored papers

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

arXiv 2026

2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

arXiv 2026

2026

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

arXiv 2025

2025

Step-Audio 2 Technical Report

arXiv 2025

2025

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

arXiv 2025

2025

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

arXiv 2025

2025

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

arXiv 2025

2025

Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model

arXiv 2025

2025

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

arXiv 2024

2024

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph

arXiv 2023

2023

HumanTOMATO: Text-aligned Whole-body Motion Generation

arXiv 2023

2023

Locally Attentional SDF Diffusion for Controllable 3D Shape Generation

arXiv 2023

2023

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

dino-detr-with-improved-denoising-anchor

2022

Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis

CVPR 2023 1

2022

Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation

mask-dino-towards-a-unified-transformer-based

2022

Affiliations

No known affiliations.

Frequent co-authors

from 15 papers

Daxin Jiang

founder

Xiangyu Zhang

Binxing Jiao

Brian Li

Changyi Wan

Guanzhe Huang

Kang An

Wei Ji

Wen Sun

Yibo Zhu