Weilin Zhao

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

The Flexibility Trap: Rethinking the Value of Arbitrary Order in Diffusion Language Models

arXiv 2026

2026

MiniCPM4: Ultra-Efficient LLMs on End Devices

arXiv 2025

2025

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

arXiv 2025

2025

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

arXiv 2025

2025

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

arXiv 2025

2025

Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding

arXiv 2024

2024

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

arXiv 2024

2024

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

arXiv 2024

2024

Tool Learning with Foundation Models

arXiv 2023

2023

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models

arXiv 2023

2023

OpenPrompt: An Open-source Framework for Prompt-learning

ACL 2022 5

2021

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Maosong Sun

professor

10 shared papers

Zhiyuan Liu

professor

Xu Han

Chaojun Xiao

Yuxiang Huang

Ning Ding

researcher

4 shared papers

Shengding Hu

researcher

Ao Sun

Kaihuo Zhang

YuXuan Li