Ying Sheng

Stanford CS PhD; co-founder and lead author of SGLang, FlexGen, and S-LoRA; previously co-led xAI's inference team.

Role: researcher
Currently at: LMSYS Org
Twitter: twitter.com/ying11231
GitHub: github.com/Ying1123
Scholar: scholar.google.com/citations
Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

11papers

Authored papers

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

ICML

2024

Post-Training Sparse Attention with Double Sparsity

arXiv 2024

2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

arXiv 2024

2024

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

NeurIPS

2023

Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90% ChatGPT Quality

blog

2023

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

arXiv 2023

2023

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU

arXiv 2023

2023

SGLang: Efficient Execution of Structured Language Model Programs

arXiv 2023

2023

Efficient Memory Management for Large Language Model Serving with PagedAttention

arXiv 2023

2023

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

arXiv 2023

2023

On Optimal Caching and Model Multiplexing for Large Model Inference

arXiv 2023

2023

Affiliations

Currently at

LMSYS Org

researcher · research group

Previously

xAIfrontier lab Stanford Universityuniversity lab University of California, Berkeleyuniversity lab

Frequent co-authors

from 11 papers

Lianmin Zheng

grad-student

10 shared papers

Ion Stoica

professor / co-founder

8 shared papers

Joseph E. Gonzalez

5 shared papers

Clark Barrett

4 shared papers

Dacheng Li

grad-student

4 shared papers

Hao Zhang

professor

4 shared papers

Zhuohan Li

researcher

4 shared papers

Banghua Zhu

professor

3 shared papers

Joseph E. Gonzalez

professor

3 shared papers

Siyuan Zhuang

researcher

3 shared papers