Inverse Reinforcement Learning With Constraint Recovery

In this work, we propose a novel inverse reinforcement learning (IRL) algorithm for constrained Markov decision process (CMDP) problems. In standard IRL problems, the inverse learner or agent seeks to recover the reward function of the MDP, given a set of trajectory…

Open

Year: 2023
ArXiv: arxiv.org/abs/2305.08130
URL: arxiv.org/abs/2305.08130v1
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2305.08130v1
TL;DR: Semantic Scholar

Attribution policy →