Monte Carlo Tree Search

Summary

Monte Carlo Tree Search is an iterative Reinforcement learning algorithm that finds optimal solutions in a highly multidimensional search space. It is a heuristic in that it does not require any knowledge beyond the “rules of the game”.

Details

The algorithm proceeds in four steps:

Selection: a child node is selected with the highest upper confidence bound applied to trees ( $U CT$ ):
$U CT = \frac{w _{i}}{n _{i}} + C \frac{ln N _{i}}{n _{i}}$
$w_{i}$ : Score, or number of wins from this node $n_{i}$ : Number of simulations from this node $N_{i}$ : Total number of simulations $C$ : A constant that is chosen empirically (usually set to $2$ )
Expansion: a new child node is added to the tree at this optimally reached point
Simulation (AKA rollout): the remainder of the process or game is played out
Backpropagation: updating score of all nodes to the top of the tree with the result

Figures

Ref geeksforgeeks.com

Quartz 4

Explorer

Monte Carlo Tree Search

Summary

Details

Figures

Graph View

Backlinks