Zining Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning
arXiv 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025 1
HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers