PufferAI / PufferLibLinks
Simplifying reinforcement learning for complex game environments
☆4,574Updated last week
Alternatives and similar repositories for PufferLib
Users that are interested in PufferLib are comparing it to the libraries listed below
Sorting:
- Entropy Based Sampling and Parallel CoT Decoding☆3,429Updated last year
- Tutorials on tinygrad☆444Updated 2 months ago
- If tinygrad wasn't small enough for you...☆759Updated last year
- Can you design a controller to steer a simulated car?☆329Updated 4 months ago
- NanoGPT (124M) in 3 minutes☆3,922Updated last week
- Frontier Models playing the board game Diplomacy.☆604Updated 3 weeks ago
- Environments for LLM Reinforcement Learning☆3,603Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,687Updated 7 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆661Updated this week
- Some ipython notebooks implementing AI algorithms☆1,391Updated 6 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,830Updated 5 months ago
- Thermodynamic Hypergraphical Model Library in JAX☆952Updated 3 weeks ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆657Updated 3 months ago
- The Fastest Deep Reinforcement Learning Library☆907Updated 2 weeks ago
- Monte Carlo tree search in JAX☆2,568Updated 3 months ago
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆786Updated last week
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆207Updated 3 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆762Updated this week
- ☆5,964Updated last week
- ☆532Updated 4 months ago
- Really Fast End-to-End Jax RL Implementations☆997Updated last year
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,262Updated last month
- Distributed Training Over-The-Internet☆967Updated last month
- RL Environments in JAX 🌍☆839Updated 6 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,207Updated 2 months ago
- Textbook on reinforcement learning from human feedback☆1,344Updated 2 weeks ago
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,922Updated last year
- Solve puzzles to improve your tinygrad skills!☆164Updated last month
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,073Updated 2 months ago
- System 2 Reasoning Link Collection☆861Updated 8 months ago