Cite
Notes
Only stored in your browser.
Attribution
Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM models
arXiv 2024
from 1 papers
Abhimanyu Bambhaniya
Geonhwa Jeong
Midhilesh Elavazhagan
Ritik Raj
Souvik Kundu
Sudarshan Srinivasan
Suvinay Subramanian
Tushar Krishna