News
Small revisions to LAO* and the epsilon-greedy policy
Written on 29.01.2024 09:06 by Joerg Hoffmann
Hi all,
this is just to draw your attention to small changes in the LAO* pseudo-code (Chapter 14) and the epsilon-greedy policy (Chapter 16):
- The LAO* pseudo-code previously allowed re-insertion of already expanded states into Frontier, which is wrong.
- The epsilon-greedy policy did not correctly handle the case where there is only one applicable action; also, in the random-exploration case, all actions are considered including the one taken by the policy.
Sorry for this inconvenience. I thought it better to correct these definitions posthoc, after the lectures, than to leave the bugs in.
best,
Jörg Hoffmann