Gabriele Oliaro
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
arXiv 2025
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
arXiv 2024
Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models
arXiv 2024
Direct Telemetry Access
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers