Cite
Notes
Only stored in your browser.
Attribution
Adam-mini: Use Fewer Learning Rates To Gain More
arXiv 2024
Hiding Data Helps: On the Benefits of Masking for Sparse Coding
arXiv 2023
Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup
arXiv 2022
from 3 papers
Muthu Chidambaram
Rong Ge
Congliang Chen
Diederik P. Kingma
Ruoyu Sun
Tian Ding
Xiang Wang
Yinyu Ye
Yu Cheng
Yushun Zhang