Cite
Notes
Only stored in your browser.
Attribution
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
In-context Learning and Gradient Descent Revisited
arXiv 2023
from 2 papers
Aaron Blakeman
Aaron Grattafiori
Aarti Basant
Abhibha Gupta
Abhinav Khattar
Adi Renduchintala
Aditya Vavre
Akanksha Shukla
Akhiad Bercovich
Aleksander Ficek