Multi-armed Bandits (MAB) problem is used to get the balance of exploration and exploitation. I want to utilize this technique to solve my problem. I want to know where I can find a good performance source code about MAB. Thanks.

asked Jul 20 '10 at 01:56

charlie's gravatar image

charlie
140121417


One Answer:

If you're just looking at the basic bandit problem the algorithms described in Finite-time analysis of the multiarmed bandit problem are probably the ones you want. Honestly these algorithms are trivial to implement. Any halfway decent programmer should be able to code them up in a few hours at most.

answered Jul 20 '10 at 02:35

Noel%20Welsh's gravatar image

Noel Welsh
72631023

Your answer
toggle preview

Subscription:

Once you sign in you will be able to subscribe for any updates here

Tags:

×2

Asked: Jul 20 '10 at 01:56

Seen: 1,518 times

Last updated: Jul 20 '10 at 02:35

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.