Hong Li
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Code as Agent Harness
arXiv 2026
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors
arXiv 2026
Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR
arXiv 2026
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation
arXiv 2025
Light of Normals: Unified Feature Representation for Universal Photometric Stereo
arXiv 2025
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
arXiv 2024
Boosting Multi-modal Model Performance with Adaptive Gradient Modulation
ICCV 2023 1
FNeVR: Neural Volume Rendering for Face Animation
arXiv 2022
Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements
EMNLP 2020 11
Affiliations
Frequent co-authors
10from 9 papers