Cite
Notes
Only stored in your browser.
Attribution
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
RULER: What's the Real Context Size of Your Long-Context Language Models?
arXiv 2024
from 2 papers
Boris Ginsburg
Cheng-Ping Hsieh
Dima Rekesh
Fei Jia
Shantanu Acharya
Simeng Sun
Aaron Blakeman
Aaron Grattafiori
Aarti Basant
Abhibha Gupta