Cite
Notes
Only stored in your browser.
Attribution
Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM models
arXiv 2024
from 1 papers
Abhimanyu Bambhaniya
Geonhwa Jeong
Madhu Kumar
Midhilesh Elavazhagan
Souvik Kundu
Sudarshan Srinivasan
Suvinay Subramanian
Tushar Krishna