Xiaoteng Ma
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7δ-mem: Efficient Online Memory for Large Language Models
arXiv 2026
Label Unbalance in High-frequency Trading
arXiv 2025
Where LLM Agents Fail and How They can Learn From Failures
arXiv 2025
SEABO: A Simple Search-Based Method for Offline Imitation Learning
arXiv 2024
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
arXiv 2023
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
arXiv 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers