Ravi Netravali
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
arXiv 2025
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
arXiv 2025
Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs
arXiv 2025
Marconi: Prefix Caching for the Era of Hybrid LLMs
arXiv 2024
Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers