Ranking In Generalized Linear Bandits
We study the ranking problem in generalized linear bandits. At each time, the learning agent selects an ordered list of items and observes stochastic outcomes. In recommendation systems, displaying an ordered list of the most attractive items is not always optimal as both…
- Year
- 2022
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.