Small revisions to LAO* and the epsilon-greedy policy

Written on 29.01.2024 09:06 by Joerg Hoffmann

Hi all,

this is just to draw your attention to small changes in the LAO* pseudo-code (Chapter 14) and the epsilon-greedy policy (Chapter 16):

  • The LAO* pseudo-code previously allowed re-insertion of already expanded states into Frontier, which is wrong.
  • The epsilon-greedy policy did not correctly handle the case where there is only one applicable action; also, in the random-exploration case, all actions are considered including the one taken by the policy.

Sorry for this inconvenience. I thought it better to correct these definitions posthoc, after the lectures, than to leave the bugs in.


Jörg Hoffmann

Privacy Policy | Legal Notice
If you encounter technical problems, please contact the administrators.