Progress: Super Success!
After implementing the evaluation action choosing method and fixing up a minor bug that caused bad play (was looking for holes horizontally), I have created a much better agent. Doing a console trainer run, after 100 episodes, it had completed a total of 525647 movements. That’s 5256 movements per episode on average. If a piece [...]