Razvan Pascanu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11From Markov to Laplace: How Mamba In-Context Learns Markov Chains
arXiv 2025
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
arXiv 2024
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
arXiv 2024
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
arXiv 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
arXiv 2024
Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis
arXiv 2024
TRecViT: A Recurrent Video Transformer
arXiv 2024
Discovering modular solutions that generalize compositionally
arXiv 2023
Promoting Exploration in Memory-Augmented Adam using Critical Momenta
arXiv 2023
NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
arXiv 2022
Model compression via distillation and quantization
model-compression-via-distillation-and-1
Affiliations
Frequent co-authors
10from 11 papers