Skip to main navigation Skip to search Skip to main content

ME-MCTS: Online Generalization by Combining Multiple Value Estimators

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

This paper addresses the challenge of online generalization in tree search. We propose Multiple Estimator Monte Carlo Tree Search (ME-MCTS), with a two-fold contribution: first, we introduce a formalization of online generalization that can represent existing techniques such as "history heuristics", "RAVE", or "OMA" -- contextual action value estimators or abstractors that generalize across specific contexts. Second, we incorporate recent advances in estimator averaging that enable guiding search by combining the online action value estimates of any number of such abstractors or similar types of action value estimators. Unlike previous work, which usually proposed a single abstractor for either the selection or the rollout phase of MCTS simulations, our approach focuses on the combination of multiple estimators and applies them to all move choices in MCTS simulations. As the MCTS tree itself is just another value estimator -- unbiased, but without abstraction -- this blurs the traditional distinction between action choices inside and outside of the MCTS tree. Experiments with three abstractors in four board games show significant improvements of ME-MCTS over MCTS using only a single abstractor, both for MCTS with random rollouts as well as for MCTS with static evaluation functions. While we used deterministic, fully observable games, ME-MCTS naturally extends to more challenging settings.
Original languageEnglish
Title of host publicationProceedings of the Thirtieth International Joint Conference on Artificial Intelligence
PublisherInternational Joint Conferences on Artificial Intelligence (IJCAI)
Pages4032-4038
Number of pages7
DOIs
Publication statusPublished - 2021
Externally publishedYes
Event30th International Joint Conference on Artificial Intelligence, IJCAI 2021 - Virtual/Online, Montreal, Canada
Duration: 19 Aug 202126 Aug 2021
Conference number: 30

Conference

Conference30th International Joint Conference on Artificial Intelligence, IJCAI 2021
Abbreviated titleIJCAI 2021
Country/TerritoryCanada
CityMontreal
Period19/08/2126/08/21

Fingerprint

Dive into the research topics of 'ME-MCTS: Online Generalization by Combining Multiple Value Estimators'. Together they form a unique fingerprint.

Cite this