Zizheng Pan
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
arXiv 2024
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
arXiv 2024
Stitchable Neural Networks
CVPR 2023 1
Stitched ViTs are Flexible Vision Backbones
arXiv 2023
Fast Vision Transformers with HiLo Attention
arXiv 2022
EcoFormer: Energy-Saving Attention with Linear Complexity
arXiv 2022
Mesa: A Memory-saving Training Framework for Transformers
arXiv 2021
Scalable Vision Transformers with Hierarchical Pooling
ICCV 2021 10
Affiliations
Frequent co-authors
10from 11 papers