Yumeng Li
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Multimodal OCR: Parse Anything from Documents
arXiv 2026
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model
arXiv 2025
PVChat: Personalized Video Chat with One-Shot Learning
ICCV 2025
Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection
arXiv 2025
Slow Perception: Let's Perceive Geometric Figures Step-by-step
arXiv 2024
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
arXiv 2024
Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive
arXiv 2024
Divide & Bind Your Attention for Improved Generative Semantic Nursing
arXiv 2023
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers