0

Lei Yang

Papers
33

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
33papers

Authored papers

33

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

arXiv 2026

2026

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

arXiv 2026

2026

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

arXiv 2026

2026

A Very Big Video Reasoning Suite

arXiv 2026

2026

Demystifying Video Reasoning

arXiv 2026

2026

EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents

arXiv 2026

2026

EgoLife: Towards Egocentric Life Assistant

CVPR 2025 1

2025

ConsistCompose: Unified Multimodal Layout Control for Image Composition

arXiv 2025

2025

Scaling Spatial Intelligence with Multimodal Foundation Models

arXiv 2025

2025

Step-Audio 2 Technical Report

arXiv 2025

2025

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

arXiv 2025

2025

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

arXiv 2025

2025

TokensGen: Harnessing Condensed Tokens for Long Video Generation

ICCV 2025

2025

ProBench: Benchmarking Large Language Models in Competitive Programming

arXiv 2025

2025

SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation

arXiv 2025

2025

MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers

arXiv 2024

2024

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

arXiv 2024

2024

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

arXiv 2024

2024

Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed Improvement of Utility-Privacy Trade-off

arXiv 2024

2024

FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data

arXiv 2024

2024

AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation

CVPR 2024 1

2024

WHAC: World-grounded Humans and Cameras

arXiv 2024

2024

GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting

CVPR 2024 1

2023

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

smpler-x-scaling-up-expressive-human-pose-and

2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering

ICCV 2023 1

2023

Contrastive Deep Nonnegative Matrix Factorization for Community Detection

arXiv 2023

2023

SHERF: Generalizable Human NeRF from a Single Image

ICCV 2023 1

2023

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

arXiv 2023

2023

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model

ICCV 2023 1

2023

ReliTalk: Relightable Talking Portrait Generation from a Single Video

arXiv 2023

2023

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction

ICCV 2023 1

2023

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

arXiv 2022

2022

Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation

CVPR 2022 1

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 33 papers