Stochastic bandits for multi-platform budget optimization in online advertising

Vashist Avadhanula,Riccardo Colini-Baldeschi,Stefano Leonardi,Karthik Abinav Sankararaman,Okke Schrijvers

Stochastic bandits for multi-platform budget optimization in online advertising

2021

We study the problem of an online advertising system that wants to optimally spend an advertiser’s given budget for a campaign across multiple platforms, without knowing the value for showing an ad to the users on those platforms. We model this challenging practical application as a Stochastic Bandits with Knapsacks problem over T rounds of bidding with the set of arms given by the set of distinct bidding m-tuples, where m is the number of platforms. We modify the algorithm proposed in Badanidiyuru et al., [11] to extend it to the case of multiple platforms to obtain an algorithm for both the discrete and continuous bid-spaces. Namely, for discrete bid spaces we give an algorithm with regret , where OPT is the performance of the optimal algorithm that knows the distributions. For continuous bid spaces the regret of our algorithm is . When restricted to this special-case, this bound improves over Sankararaman and Slivkins [34] in the regime OPT

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations