Cite
Notes
Only stored in your browser.
Attribution
MiniCPM4: Ultra-Efficient LLMs on End Devices
arXiv 2025
Tokenization Falling Short: On Subword Robustness in Large Language Models
arXiv 2024
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding
from 3 papers
Chaojun Xiao
Kaihuo Zhang
Maosong Sun
professor
Weilin Zhao
Xu Han
Yuxiang Huang
Zhiyuan Liu
Ao Sun
Bingxiang He
Biyuan Lin