A series of macro and micro optimizations to make Puffer RL training faster and scale better.
Scaling RL: GPU multithreaded batching with Puffer
A series of macro and micro optimizations to make Puffer RL training faster and scale better.
Using a toy but playable pixel platformer to create agents that can play along-side you.
(Part 2/2) Understanding the internals of an LSTM by (de)constructing an 1-bit LSTM.
(Part 1/2) A creative coding experiment to explore LSTMs with your touch