Zhan Tong
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
CVPR 2023 1
Bootstrapping SparseFormers from Vision Foundation Models
CVPR 2024 1
Efficient Video Action Detection with Token Dropout and Context Refinement
ICCV 2023 1
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
arXiv 2023
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
videomae-masked-autoencoders-are-data
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
arXiv 2022
Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers