0

Qi Tian

Papers
32

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
32papers

Authored papers

32

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

arXiv 2026

2026

LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding

arXiv 2026

2026

MagCache: Fast Video Generation with Magnitude-Aware Cache

arXiv 2025

2025

Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

arXiv 2025

2025

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

arXiv 2025

2025

PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards

arXiv 2025

2025

HunyuanVideo 1.5 Technical Report

arXiv 2025

2025

DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

arXiv 2025

2025

EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture

arXiv 2025

2025

HunyuanImage 3.0 Technical Report

arXiv 2025

2025

GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting

arXiv 2024

2024

Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

arXiv 2024

2024

HunyuanVideo: A Systematic Framework For Large Video Generative Models

arXiv 2024

2024

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

arXiv 2024

2024

Towards 3D Molecule-Text Interpretation in Language Models

arXiv 2024

2024

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

CVPR 2024 1

2023

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

CVPR 2024 1

2023

ControlVideo: Training-free Controllable Text-to-Video Generation

arXiv 2023

2023

SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval

arXiv 2023

2023

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

arXiv 2023

2023

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems

arXiv 2023

2023

Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions

arXiv 2023

2023

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation

ICCV 2023 1

2023

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data

CVPR 2024 1

2023

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

arXiv 2022

2022

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration

CVPR 2023 1

2022

DocScanner: Robust Document Image Rectification with Progressive Learning

arXiv 2021

2021

Visformer: The Vision-friendly Transformer

ICCV 2021 10

2021

Rectifying the Shortcut Learning of Background for Few-Shot Learning

NeurIPS 2021 12

2021

Large-Scale Spatio-Temporal Person Re-identification: Algorithms and Benchmark

arXiv 2021

2021

GhostNet: More Features from Cheap Operations

ghostnet-more-features-from-cheap-operations-1

2019

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

ICLR 2020 1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 32 papers