Sainbayar Sukhbaatar
- Papers
- 9
Cite
Notes
Only stored in your browser.
9papers
Authored papers
9Multi-Token Attention
arXiv 2025
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
arXiv 2025
Training Large Language Models to Reason in a Continuous Latent Space
arXiv 2024
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
arXiv 2024
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
arXiv 2024
Learning to Reason and Memorize with Self-Notes
learning-to-reason-and-memorize-with-self
A Data Source for Reasoning Embodied Agents
arXiv 2023
Memory-Augmented Reinforcement Learning for Image-Goal Navigation
arXiv 2021
End-To-End Memory Networks
end-to-end-memory-networks-1
Affiliations
No known affiliations.
Frequent co-authors
10from 9 papers