0

Qinglin Lu

Papers
22

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
22papers

Authored papers

22

TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

arXiv 2026

2026

Hierarchical SVG Tokenization: Learning Compact Visual Programs for Scalable Vector Graphics Modeling

arXiv 2026

2026

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

arXiv 2026

2026

SOAR: Self-Correction for Optimal Alignment and Refinement in Diffusion Models

arXiv 2026

2026

Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models

arXiv 2026

2026

EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation

arXiv 2026

2026

Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

arXiv 2026

2026

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

arXiv 2026

2026

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

arXiv 2025

2025

HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters

arXiv 2025

2025

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

ICCV 2025

2025

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

arXiv 2025

2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

arXiv 2025

2025

Video Generation Models Are Good Latent Reward Models

arXiv 2025

2025

Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy

arXiv 2025

2025

HunyuanImage 3.0 Technical Report

arXiv 2025

2025

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

arXiv 2025

2025

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

arXiv 2025

2025

HunyuanVideo: A Systematic Framework For Large Video Generative Models

arXiv 2024

2024

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

arXiv 2024

2024

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

arXiv 2024

2024

SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 22 papers