Offline Imitation Learning with Variational Counterfactual Reasoning

In offline imitation learning (IL), an agent aims to learn an optimal expert behavior policy without additional online environment interactions. However, in many real-world scenarios, such as robotics manipulation, the offline dataset is collected from suboptimal behaviors…

Open

Year: 2023
ArXiv: arxiv.org/abs/2310.04706
URL: arxiv.org/abs/2310.04706v4
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2310.04706v4
TL;DR: Semantic Scholar

Attribution policy →