Show HN: Watch a neural net learn to play Snake
ppo.gradexp.xyzIn browser PPO training demo, made possible by tinygrad: TinyJit -> WebGPU kernels.
Requires WebGPU.
In browser PPO training demo, made possible by tinygrad: TinyJit -> WebGPU kernels.
Requires WebGPU.
Cool project!
I noticed that if you go from training to watch and then back, the training temporarily drop significantly in score.
More details and implementation notes please?