Jiahao Li
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Qwen-Image-VAE-2.0 Technical Report
arXiv 2026
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
arXiv 2026
Efficient Autoregressive Video Diffusion with Dummy Head
arXiv 2026
Qwen-Image Technical Report
arXiv 2025
Seed1.5-VL Technical Report
arXiv 2025
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition
arXiv 2025
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
arXiv 2025
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
arXiv 2025
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
arXiv 2025
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
arXiv 2025
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback
arXiv 2024
Neural Video Compression with Feature Modulation
CVPR 2024 1
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
arXiv 2024
JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery
ICCV 2023 1
Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity
arXiv 2022
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
CVPR 2023 1
Affiliations
Frequent co-authors
10from 16 papers