Kai Liu
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18Audio-Visual Intelligence in Large Foundation Models
arXiv 2026
JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation
arXiv 2026
PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks
arXiv 2026
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels
arXiv 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
arXiv 2025
Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
arXiv 2025
Seed1.5-VL Technical Report
arXiv 2025
Low-bit Model Quantization for Deep Neural Networks: A Survey
arXiv 2025
JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization
arXiv 2025
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
arXiv 2025
A Modular Dataset to Demonstrate LLM Abstraction Capability
arXiv 2025
Structure-aware Domain Knowledge Injection for Large Language Models
arXiv 2024
INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection
arXiv 2024
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
arXiv 2024
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
arXiv 2024
Parallel Speculative Decoding with Adaptive Draft Length
arXiv 2024
Uncertainty-aware Unsupervised Multi-Object Tracking
ICCV 2023 1
Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech
arXiv 2020
Affiliations
Frequent co-authors
10from 18 papers