Zhi-Quan Luo
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Adam-mini: Use Fewer Learning Rates To Gain More
arXiv 2024
Why Transformers Need Adam: A Hessian Perspective
arXiv 2024
TeleQnA: A Benchmark Dataset to Assess Large Language Models Telecommunications Knowledge
arXiv 2023
Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks
ICCV 2023 1
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers