Cite
Notes
Only stored in your browser.
Attribution
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
arXiv 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
from 2 papers
Boyuan Wang
Chang Liu
Jun Xu
Xiangyang Ji
XianTong Zhen
Yingdong Shi
Yixiu Mao
Yuchen Yang
Yuhang Jiang
Yun Qu