STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning

Centralized Training with Decentralized Execution (CTDE) has been proven to be an effective paradigm in cooperative multi-agent reinforcement learning (MARL). One of the major challenges is credit assignment, which aims to credit agents by their contributions.

Open

Year: 2023
ArXiv: arxiv.org/abs/2304.07520
URL: arxiv.org/abs/2304.07520v2
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2304.07520v2
TL;DR: Semantic Scholar

Attribution policy →