Zhengkai Jiang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis
arXiv 2026
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
arXiv 2025
HunyuanImage 3.0 Technical Report
arXiv 2025
Efficient Multimodal Large Language Models: A Survey
arXiv 2024
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
arXiv 2024
ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
arXiv 2024
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
arXiv 2024
Personalize Segment Anything Model with One Shot
arXiv 2023
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
arXiv 2023
Rethinking Mobile Block for Efficient Attention-based Models
ICCV 2023 1
You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction
arXiv 2022
Affiliations
Frequent co-authors
10from 11 papers