Battle of bandits: online learning from subsetwise preferences and other structured feedback
by Aadirupa Saha; advised by Chiranjib Bhattacharyya and Aditya Gopalan
- Bengaluru IISc 2021
- xxi, 448p.
include bibliographical reference and index
PhD; IISc; 2021
Bandit Algorithms Contextual Multiarmed Bandits Online Sequential Learning From Preferences