Ranking In Generalized Linear Bandits

We study the ranking problem in generalized linear bandits. At each time, the learning agent selects an ordered list of items and observes stochastic outcomes. In recommendation systems, displaying an ordered list of the most attractive items is not always optimal as both…

Open

Year: 2022
ArXiv: arxiv.org/abs/2207.00109
URL: arxiv.org/abs/2207.00109v2
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2207.00109v2
TL;DR: Semantic Scholar

Attribution policy →