Cite
Notes
Only stored in your browser.
Attribution
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer
arXiv 2025
from 1 papers
Li Zhiyuan
Lin Yueyu
Liu Xiao