Sadegh Mahdavi
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation
arXiv 2025
Memorization Capacity of Multi-Head Attention in Transformers
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers