Daoan Zhang
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist
arXiv 2025
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
arXiv 2025
Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs
arXiv 2025
On Path to Multimodal Generalist: General-Level and General-Bench
arXiv 2025
SePPO: Semi-Policy Preference Optimization for Diffusion Alignment
arXiv 2024
GaussianStyle: Gaussian Head Avatar via StyleGAN
arXiv 2024
GPT-4V(ision) as A Social Media Analysis Engine
arXiv 2023
Cross Contrasting Feature Perturbation for Domain Generalization
ICCV 2023 1
Video Understanding with Large Language Models: A Survey
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers