Siddharth Singh
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
Gemstones: A Model Suite for Multi-Faceted Scaling Laws
arXiv 2025
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
arXiv 2024
Loki: Low-rank Keys for Efficient Sparse Attention
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers