Hang Hua
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?
arXiv 2026
Aurora: Unified Video Editing with a Tool-Using Agent
arXiv 2026
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
arXiv 2025
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
arXiv 2025
Latent Chain-of-Thought for Visual Reasoning
arXiv 2025
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
arXiv 2025
Generative AI for Cel-Animation: A Survey
arXiv 2025
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
arXiv 2025
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
arXiv 2025
PromptFix: You Prompt and We Fix the Photo
arXiv 2024
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
CVPR 2025 1
GaussianStyle: Gaussian Head Avatar via StyleGAN
arXiv 2024
VideoXum: Cross-modal Visual and Textural Summarization of Videos
arXiv 2023
PromptCap: Prompt-Guided Task-Aware Image Captioning
arXiv 2022
Affiliations
Frequent co-authors
10from 14 papers