Scalarization based Pareto optimal set of arms identification algorithms

  • M.M. Drugan
  • , A. Nowe

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

17 Citations (Scopus)

Abstract

Multi-objective multi-armed bandits (MOMAB) is an extension of the multi-objective multi-armed bandits framework that considers reward vectors instead of scalar reward values. Scalarization functions transform the reward vectors into reward values in order to use the standard multi-armed bandits (MAB) algorithms. However for many applications it is not obvious to come up with a good scalarization set and therefore there is needed to develop MAB that discover the whole Pareto set of arms. Our approach to this multi-objective MAB problem is two folded: i) identify the set of Pareto optimal arms and ii) identify the minimum subset of scalarization functions that optimize the set of Pareto optimal arms. We experimentally compare the proposed MOMAB algorithms on a multi-objective Bernoulli problem.

Original languageEnglish
Title of host publicationProceedings of the International Joint Conference on Neural Networks
Place of PublicationPiscataway
PublisherInstitute of Electrical and Electronics Engineers
Pages2690-2697
Number of pages8
ISBN (Electronic)978-1-4799-1484-5
DOIs
Publication statusPublished - 3 Sept 2014
Externally publishedYes
Event2014 International Joint Conference on Neural Networks, IJCNN 2014 - Beijing International Convention Center, Beijing, China
Duration: 6 Jul 201411 Jul 2014
http://www.ieee-wcci2014.org

Conference

Conference2014 International Joint Conference on Neural Networks, IJCNN 2014
Abbreviated titleIJCNN 2014
Country/TerritoryChina
CityBeijing
Period6/07/1411/07/14
OtherInternational Joint Conference on Neural Networks
Internet address

Fingerprint

Dive into the research topics of 'Scalarization based Pareto optimal set of arms identification algorithms'. Together they form a unique fingerprint.

Cite this