Cite
Notes
Only stored in your browser.
Attribution
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
arXiv 2025
Secrets of RLHF in Large Language Models Part II: Reward Modeling
arXiv 2024
from 2 papers
Qi Zhang
Tao Gui
Tao Ji
Xipeng Qiu
Zhan Chen
Bin Guo
Binghai Wang
Caishuang Huang
Chenyu Shi
Enyu Zhou