A series of macro and micro optimizations to make Puffer RL training faster and scale better.
Scaling RL: GPU multithreaded batching with Puffer
Hello! I am a software engineer. I am building a mindfulness game/app.
I am also exploring and exploiting reinforcement learning for fun and serious learning.
You can read my blog below where I write about my learnings in reinforcement learning.A series of macro and micro optimizations to make Puffer RL training faster and scale better.
Using a toy but playable pixel platformer to create agents that can play along-side you.
(Part 2/2) Understanding the internals of an LSTM by (de)constructing an 1-bit LSTM.
(Part 1/2) A creative coding experiment to explore LSTMs with your touch