>_ Train RL Hunter
DQN · runs in browser · saves to game automatically
← back to escape grid
// status
state idle
episode0 / 8000
win rate (last 200)
avg reward
epsilon1.000
replay buffer0 / 60000
// controls
episodes 8000
speed (ep/tick) 3
✓ weights saved — H: RL ready in game
// hyperparams
network62→128→128→14
optimizerAdam lr=1e-3
gamma0.95
batch64
ε decay×0.9995/ep
target syncevery 300 steps
participant AI70% greedy
// win rate over time
// training log