Cite
Notes
Only stored in your browser.
Attribution
From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation
arXiv 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
arXiv 2024
from 2 papers
Abdelhakim Benechehab
Albert Thomas
Balázs Kégl
Giuseppe Paolo
Maurizio Filippone
Ambroise Odonnat
Corentin Léger
Gabriel Singer
Ievgen Redko
Oussama Zekri