We introduce the Llama-Nemotron series of models, an open family of heterogeneous reasoning models that deliver exceptional reasoning capabilities, inference efficiency, and an open license for enterprise use. The family comes in three sizes -- Nano (8B), Super (49B), and Ultra (253B) -- and performs competitively with state-of-the-art reasoning models such as DeepSeek-R1 while offering superior inference throughput and memory efficiency. In this report, we discuss the training procedure for these models, which entails using neural architecture search from Llama 3 models for accelerated inference, knowledge distillation, and continued pretraining, followed by a reasoning-focused post-training stage consisting of two main parts: supervised fine-tuning and large scale reinforcement learning. Llama-Nemotron models are the first open-source models to support a dynamic reasoning toggle, allowing users to switch between standard chat and reasoning modes during inference. To further support open research and facilitate model development, we provide the following resources: 1. We release the Llama-Nemotron reasoning models -- LN-Nano, LN-Super, and LN-Ultra -- under the commercially permissive NVIDIA Open Model License Agreement. 2. We release the complete post-training dataset: Llama-Nemotron-Post-Training-Dataset. 3. We also release our training codebases: NeMo, NeMo-Aligner, and Megatron-LM.
Llama-Nemotron: Efficient Reasoning Models
We introduce the Llama-Nemotron series of models, an open family of heterogeneous reasoning models that deliver exceptional reasoning capabilities, inference efficiency, and an open license for enterprise use.
- Year
- 2025
- Venue
- arXiv 2025
- Authors
- 133
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2505.00949v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
133Bryan CatanzaroGerald ShenJiaqi ZengJimmy ZhangOleksii KuchaievOlivier DelalleauZhilin WangRan ZilbersteinYonatan GeifmanRoger WaleffeBrandon NorickDeepak NarayananLeon DerczynskiErick GalinkinAkhiad BercovichItay LevyIzik GolanMohammad DabbahRan El-YanivOmri PunyIdo GalilZach MosheTomer RonenNajeeb NabwaniIdo ShahafOren TroppEhud KarpasSoumye SinghalAlexander BukharinYian ZhangTugrul KonukAmeya Sunil MahabaleshwarkarBilal KartalYoshi SuharaZijia ChenDavid MosallanezhadAdi RenduchintalaHaifeng QianDima RekeshFei JiaSomshubra MajumdarVahid NorooziWasi Uddin AhmadSean NarenthiranAleksander FicekMehrzad SamadiJocelyn HuangSiddhartha JainIgor GitmanIvan MoshkovWei DuShubham ToshniwalGeorge ArmstrongBranislav KisacaninMatvei NovikovDaria GitmanEvelina BakhturinaJane Polak ScowcroftJohn KamaluDan SuKezhi KongMarkus KlieglRabeeh KarimiYing LinSanjeev SatheeshJupinder ParmarPritam GundechaJoseph JenningsShrimai PrabhumoyeSyeda Nahida AkterMostofa PatwaryAbhinav KhattarBor-Yiing SuGuyue HuangTerry KongParth ChadhaSahil JainChristine HarveyElad SegalJining HuangSergey KashirskyRobert McQueenIzzy PuttermanGeorge LamArun VenkatesanSherry WuVinh NguyenManoj KilaruAndrew WangAnna WarnoAbhilash SomasamudramathSandip BhaskarMaka DongNave AssafShahar MorOmer Ullman ArgovScot JunkinOleksandr RomanenkoPedro LarroyMarco RovinelliViji BalasNicholas EdelmanAnahita BhiwandiwallaMuthu SubramaniamSmita IthapeKarthik RamamoorthyYuting WuSuguna Varshini VeluryOmri AlmogJoyjit DawDenys FridmanMichael EvansShaona GhoshKatherine LunaNikki PopeEileen LongSeth SchneiderGuillermo SimanTomasz GrzegorzekPablo RibaltaMonika KatariyaChris AlexiukJoey ConwayTrisha SaarAnn GuanKrzysztof PawelecShyamala PrayagaBoris GinsburgOluwatobi OlabiyiKari BriskiJonathan CohenJonah AlbenEric Chung