Yushun Zhang

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

Kimi K2.5: Visual Agentic Intelligence

arXiv 2026

Adam-mini: Use Fewer Learning Rates To Gain More

arXiv 2024

Why Transformers Need Adam: A Hessian Perspective

arXiv 2024

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

arXiv 2023

No known affiliations.

from 4 papers

Ruoyu Sun

Zhi-Quan Luo

Ziniu Li

Congliang Chen

Tian Ding

Aidi Li

Angang Du

Ao Wang

Bo Pang

Bohong Yin