Zhengkai Jiang

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

arXiv 2026

2026

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

arXiv 2025

2025

HunyuanImage 3.0 Technical Report

arXiv 2025

2025

Efficient Multimodal Large Language Models: A Survey

arXiv 2024

2024

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

arXiv 2024

2024

UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models

arXiv 2024

2024

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

arXiv 2024

2024

Personalize Segment Anything Model with One Shot

arXiv 2023

2023

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

arXiv 2023

2023

Rethinking Mobile Block for Efficient Attention-based Models

ICCV 2023 1

2023

You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Peng Gao

Hongsheng Li

Hao Dong

Siyuan Huang

Chengjie Wang

Hangting Chen

Jian Li

Jian Yang

Lucas Wang

Shi-Xue Zhang