Yewei Fang

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

MiniCPM4: Ultra-Efficient LLMs on End Devices

arXiv 2025

Tokenization Falling Short: On Subword Robustness in Large Language Models

arXiv 2024

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding

arXiv 2024

No known affiliations.

from 3 papers

Chaojun Xiao

Kaihuo Zhang

Maosong Sun

professor

Weilin Zhao

Xu Han

Yuxiang Huang

Zhiyuan Liu

professor

Ao Sun

Bingxiang He

Biyuan Lin