Shuang Ma

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

arXiv 2024

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

arXiv 2024

TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

arXiv 2023

Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

ICCV 2023 1

No known affiliations.

from 4 papers

Ruijie Zheng

Furong Huang

Hal Daumé III

Huazhe Xu

Xiyao Wang

Yanchao Sun

Ashish Kapoor

Bernhard Aumayer

Felix Bai

Feng Nan