Ming Ding
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
arXiv 2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
arXiv 2024
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
arXiv 2024
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
arXiv 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
arXiv 2024
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
arXiv 2024
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations
arXiv 2024
LVBench: An Extreme Long Video Understanding Benchmark
ICCV 2025
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024 1
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
imagereward-learning-and-evaluating-human
GPT Can Solve Mathematical Problems Without a Calculator
arXiv 2023
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
relay-diffusion-unifying-diffusion-process
GLM-130B: An Open Bilingual Pre-trained Model
arXiv 2022
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers
arXiv 2022
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
ACL 2022 5
CogView: Mastering Text-to-Image Generation via Transformers
NeurIPS 2021 12
GPT Understands, Too
arXiv 2021
GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training
arXiv 2020
Affiliations
Frequent co-authors
10from 19 papers