Maze Problem with the reinforcement learning method proposed by
Murakoshi and Mizuno(2004)(help[japanese])
First, make an agent learn a route to the goal.
Secondly, choice emergency 0 or 1 as a new wall.
This method appropriately controls three learning parameters.
(Compare with the conventional method.)
(If this program is not executed, please install Java VM on here or here.)
Copyright(c) Ryoji Ino and Kazushi Murakoshi. All rights reserved.
Based on Hajime Kimura.
Last Modified: 27 December, 2007