Progress: Overnight Run Report
Yesterday I left an agent running on default settings (20 trials, 0.99 cooling rate) to try and find the ideal agent for playing standard Tetris (parameter 0). Here are my results:
Using best parameter: Mults: {1, 1, 2, 38, 8}, worth 0.53121482649264, over 1091020 steps.
Policy:
Mults: {1, 1, 2, 38, 8}, worth 0.53121482649264, over 1091020 steps.
Mults: {1, [...]