PufferAI / PufferLibLinks
Simplifying reinforcement learning for complex game environments
☆4,930Updated this week
Alternatives and similar repositories for PufferLib
Users that are interested in PufferLib are comparing it to the libraries listed below
Sorting:
- NanoGPT (124M) in 2 minutes☆4,410Updated last week
- Entropy Based Sampling and Parallel CoT Decoding☆3,436Updated last year
- Thermodynamic Hypergraphical Model Library in JAX☆992Updated 2 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆672Updated this week
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,720Updated last month
- Tutorials on tinygrad☆453Updated 3 months ago
- Implementation for MatMul-free LM.☆3,051Updated last month
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,754Updated 9 months ago
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆206Updated 5 months ago
- Our library for RL environments + evals☆3,774Updated this week
- If tinygrad wasn't small enough for you...☆768Updated last year
- Solve puzzles to improve your tinygrad skills!☆176Updated 3 months ago
- Really Fast End-to-End Jax RL Implementations☆1,013Updated last year
- Frontier Models playing the board game Diplomacy.☆624Updated last month
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,855Updated 7 months ago
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆800Updated last month
- Repository for the Lux AI Challenge, season 3 @NeurIPS 24. Hosted on @kaggle☆325Updated 11 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,033Updated 5 months ago
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆2,971Updated 2 weeks ago
- ☆540Updated 5 months ago
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,952Updated last year
- The Fastest Deep Reinforcement Learning Library☆923Updated last week
- Can you design a controller to steer a simulated car?☆342Updated 6 months ago
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆668Updated 5 months ago
- Implementation of all RL algorithms in a simpler way☆1,383Updated 5 months ago
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…☆866Updated last week
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,222Updated 4 months ago
- Monte Carlo tree search in JAX☆2,584Updated 4 months ago
- ☆909Updated last week
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆384Updated 3 months ago