0

Kai Liu

Papers
18

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
18papers

Authored papers

18

Audio-Visual Intelligence in Large Foundation Models

arXiv 2026

2026

JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

arXiv 2026

2026

PlanViz: Evaluating Planning-Oriented Image Generation and Editing for Computer-Use Tasks

arXiv 2026

2026

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

arXiv 2025

2025

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

arXiv 2025

2025

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

arXiv 2025

2025

Seed1.5-VL Technical Report

arXiv 2025

2025

Low-bit Model Quantization for Deep Neural Networks: A Survey

arXiv 2025

2025

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

arXiv 2025

2025

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

arXiv 2025

2025

A Modular Dataset to Demonstrate LLM Abstraction Capability

arXiv 2025

2025

Structure-aware Domain Knowledge Injection for Large Language Models

arXiv 2024

2024

INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection

arXiv 2024

2024

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

arXiv 2024

2024

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

arXiv 2024

2024

Parallel Speculative Decoding with Adaptive Draft Length

arXiv 2024

2024

Uncertainty-aware Unsupervised Multi-Object Tracking

ICCV 2023 1

2023

Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech

arXiv 2020

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 18 papers