Shubham Toshniwal
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Llama-Nemotron: Efficient Reasoning Models
arXiv 2025
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset
arXiv 2025
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
arXiv 2024
Nemotron-4 340B Technical Report
arXiv 2024
Major Entity Identification: A Generalizable Alternative to Coreference Resolution
arXiv 2024
Learning to Reason and Memorize with Self-Notes
learning-to-reason-and-memorize-with-self
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
Chess as a Testbed for Language Model State Tracking
arXiv 2021
On Generalization in Coreference Resolution
CRAC (ACL) 2021 11
Affiliations
Frequent co-authors
10from 10 papers
Igor Gitman
Ivan Moshkov
Sean Narenthiran
Aleksander Ficek
Ameya Sunil Mahabaleshwarkar
Boris Ginsburg
Bryan Catanzaro
researcher
Dan Su
Daria Gitman
Deepak Narayanan