Mostofa Patwary
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Llama-Nemotron: Efficient Reasoning Models
arXiv 2025
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
Pretraining Large Language Models with NVFP4
arXiv 2025
RLP: Reinforcement as a Pretraining Objective
arXiv 2025
Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset
arXiv 2024
Nemotron-4 340B Technical Report
arXiv 2024
Compact Language Models via Pruning and Knowledge Distillation
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers
Bryan Catanzaro
researcher
Mohammad Shoeybi
Dan Su
Deepak Narayanan
Joseph Jennings
Sanjeev Satheesh
Shrimai Prabhumoye
Aleksander Ficek
Ameya Sunil Mahabaleshwarkar
Boris Ginsburg