Dawei Leng
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6FG-CLIP: Fine-Grained Visual and Textual Alignment
arXiv 2025
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
arXiv 2025
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance
arXiv 2024
Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-Task
arXiv 2024
IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities
arXiv 2024
CCMB: A Large-scale Chinese Cross-modal Benchmark
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers