Zhefeng Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Taming the Titans: A Survey of Efficient LLM Inference Serving
arXiv 2025
OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure
arXiv 2024
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
arXiv 2024
Mirror: A Universal Framework for Various Information Extraction Tasks
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers