0

A Model-Free Universal AI

In general reinforcement learning, all established optimal agents, including AIXI, are model-based, explicitly maintaining and using environment models. This paper introduces Universal AI with Q-Induction (AIQI), the first model-free agent proven to be asymptotically…

Preview
Year
2026
Hosting
Full text hostedCC-BY-4.0

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2602.23242CC-BY-4.0
TL;DR
Semantic Scholar
Attribution policy →

Abstract

In general reinforcement learning, all established optimal agents, including AIXI, are model-based, explicitly maintaining and using environment models. This paper introduces Universal AI with Q-Induction (AIQI), the first model-free agent proven to be asymptotically \varepsilon-optimal in general RL. AIQI performs universal induction over distributional action-value functions, instead of policies or environments like previous works. Under a grain of truth condition, we prove that AIQI is strong asymptotically \varepsilon-optimal and asymptotically \varepsilon-Bayes-optimal. We also apply our novel proof techniques to show asymptotic \varepsilon-optimality of Self-AIXI without any ad-hoc assumptions. Our results significantly expand the diversity of known universal agents.