Combining Multiple Strategies for Multiarmed Bandit Problems and Asymptotic Optimality

Hyeong Soo Chang; Sanghee Choe

doi:10.1155/2015/264953

What is it about?

This paper proposes an algorithm combining multiple strategies on the stochastic multi-armed bandit problem. From experiments of Auer et al. (2002), we can know that multi-armed bandit algorithms perform differently on problems with different reward distributions. In the situation where it is not known which strategies are best our algorithm is one solution.

Why is it important?

Theoretically, the proposed algorithm epsilon_t-comb converges to the best strategy asymptotically. The definition of best strategy is found in the paper.

This page is a summary of: Combining Multiple Strategies for Multiarmed Bandit Problems and Asymptotic Optimality, Journal of Control Science and Engineering, January 2015, Hindawi Publishing Corporation,
DOI: 10.1155/2015/264953.
You can read the full text:

Read

Contributors

The following have contributed to this page

Dr Choe Sanghee
Sogang University

Combining Multiple Strategies for Multiarmed Bandit Problems and Asymptotic Optimality

What is it about?

Why is it important?

Contributors

You might also like

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Combining Multiple Strategies for Multiarmed Bandit Problems and Asymptotic Optimality

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

You might also like

Algorithm for dealing with time-varying signal within sliding-window for harmonics estimation

Detect Malicious Web Pages Using Naive Bayesian Algorithm to Detect Cyber Threats

Cr-Mo-V-W: A new refractory and transition metal high-entropy alloy system

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management