Cite
Notes
Only stored in your browser.
Attribution
AXLearn: Modular Large Model Training on Heterogeneous Infrastructure
arXiv 2025
Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution
arXiv 2024
Punica: Multi-Tenant LoRA Serving
arXiv 2023
from 3 papers
Arvind Krishnamurthy
BoWen Zhang
Chang Lan
Cheng Leong
Chung-Cheng Chiu
David Qiu
Dongseong Hwang
Floris Weers
Guoli Yin
Hanzhi Zhou