Cite
Notes
Only stored in your browser.
Attribution
WebGames: Challenging General-Purpose Web-Browsing AI Agents
arXiv 2025
LM2: Large Memory Models
Discovering Preference Optimization Algorithms with and for Large Language Models
arXiv 2024
Dense Reward for Free in Reinforcement Learning from Human Feedback
from 4 papers
Andy Toulis
Filippos Christianos
Fraser Greenlee
George Thomas
Jikun Kang
Marvin Purtorab
Mihaela van der Schaar
Samuel Holt
Wenqi Wu
Chris Lu