Cite
Notes
Only stored in your browser.
Attribution
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
arXiv 2025
from 1 papers
Edoardo Fadda
Emilio Del-Moral-Hernandez
Leonardo Kanashiro Felizardo
Mariá Cristina Vasconcelos Nascimento