Cite
Notes
Only stored in your browser.
Attribution
EPO: Hierarchical LLM Agents with Environment Preference Optimization
arXiv 2024
Natural Language Reinforcement Learning
Meta-Learning Parameterized Skills
arXiv 2022
from 3 papers
George Konidaris
Bo Liu
researcher
Chen Sun
Girish A. Koushik
Jun Wang
Mengyue Yang
Michael Littman
Qi Zhao
Saket Tiwari
Shangqun Yu