Shuang Ma
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
arXiv 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
arXiv 2024
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
arXiv 2023
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
ICCV 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers