I currently writing my “one-shot environments” section of the thesis and rerun old experiments.
(Maybe a bug causes this but) my “Deletion” agent performs worst than the Vanilla Agent in the Sinus environments.
The problem is that the “Deletion” agent start to focus “too” fast on local maxima instead of looking for the global maxima.
I changed the C in the UCT formula to various values but the result over 1000 runs stays always the same.
I will discuss those results with Kurt during our meeting today.