Yilong Zhao

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

arXiv 2026

2026

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

arXiv 2025

2025

FrontierCS: Evolving Challenges for Evolving Intelligence

arXiv 2025

2025

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

arXiv 2025

2025

XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models

arXiv 2024

2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

arXiv 2024

2024

NanoFlow: Towards Optimal Large Language Model Serving Throughput

arXiv 2024

2024

Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

arXiv 2024

2024

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Song Han

4 shared papers

Baris Kasikci

3 shared papers

Ion Stoica

professor / co-founder

Kan Zhu

Yujun Lin

Arvind Krishnamurthy

Chenfeng Xu

Chien-Yu Lin

Han Cai

Haocheng Xi