Yuxiang Huang

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

MiniCPM4: Ultra-Efficient LLMs on End Devices

arXiv 2025

2025

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

arXiv 2025

2025

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

arXiv 2025

2025

NOSA: Native and Offloadable Sparse Attention

arXiv 2025

2025

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

arXiv 2025

2025

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices

arXiv 2024

2024

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding

arXiv 2024

2024

Tool Learning with Foundation Models

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Xu Han

8 shared papers

Zhiyuan Liu

professor

8 shared papers

Chaojun Xiao

6 shared papers

Maosong Sun

professor

6 shared papers

Weilin Zhao

5 shared papers

Ganqu Cui

researcher

3 shared papers

Jie zhou

3 shared papers

Kaihuo Zhang

3 shared papers

Ning Ding

researcher

3 shared papers

YuXuan Li

3 shared papers