Xiaohan Ding
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Seed1.5-VL Technical Report
arXiv 2025
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations
arXiv 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
arXiv 2024
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
arXiv 2024
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
CVPR 2024 1
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
arXiv 2023
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
CVPR 2022 1
RepVGG: Making VGG-style ConvNets Great Again
CVPR 2021 1
RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition
arXiv 2021
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks
global-sparse-momentum-sgd-for-pruning-very-1
Affiliations
Frequent co-authors
10from 10 papers