Cite
Notes
Only stored in your browser.
Attribution
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
Adam-mini: Use Fewer Learning Rates To Gain More
arXiv 2024
Why Transformers Need Adam: A Hessian Perspective
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
arXiv 2023
from 4 papers
Ruoyu Sun
Zhi-Quan Luo
Ziniu Li
Congliang Chen
Tian Ding
Aidi Li
Angang Du
Ao Wang
Bo Pang
Bohong Yin