Thompson sampling is a strategy to explore a space while exploiting the wins. In this video we see an application to winning at a game of one-armed bandits.
Beta distributions video: • The Beta distribution ...
Tom Denton blog: inventingsitua...
Icons made by Freepik from www.flaticon.com
Announcement: Book by Luis Serrano! Grokking Machine Learning. bit.ly/grokkingML
40% discount code: serranoyt
Негізгі бет Thompson sampling, one armed bandits, and the Beta distribution
Пікірлер: 26