Cite
Notes
Only stored in your browser.
Attribution
ReMamba: Equip Mamba with Effective Long-Sequence Modeling
arXiv 2024
Lifting the Curse of Capacity Gap in Distilling Language Models
arXiv 2023
from 2 papers
Jingang Wang
Bei Li
Benyou Wang
Chen Zhang
Danlong Yuan
Dawei Song
Dongyan Zhao
Huishuai Zhang
Xunliang Cai
Yang Yang