PufferAI / PufferLibLinks
Simplifying reinforcement learning for complex game environments
☆2,763Updated this week
Alternatives and similar repositories for PufferLib
Users that are interested in PufferLib are comparing it to the libraries listed below
Sorting:
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆603Updated 8 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆555Updated last week
- Tutorials on tinygrad☆393Updated last month
- NanoGPT (124M) in 3 minutes☆2,811Updated this week
- Really Fast End-to-End Jax RL Implementations☆908Updated 10 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,396Updated 8 months ago
- If tinygrad wasn't small enough for you...☆724Updated last year
- ☆408Updated last week
- Can you design a controller to steer a simulated car?☆258Updated 6 months ago
- Frontier Models playing the board game Diplomacy.☆522Updated this week
- 🕹️ A diverse suite of scalable reinforcement learning environments in JAX☆739Updated last month
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,067Updated 3 months ago
- Distributed Training Over-The-Internet☆946Updated 2 months ago
- Multi-Agent Reinforcement Learning with JAX☆606Updated last week
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆343Updated this week
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary h…☆794Updated this week
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆151Updated 2 weeks ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,495Updated 6 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,479Updated 3 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆326Updated last week
- The n-gram Language Model☆1,433Updated 11 months ago
- ☆154Updated last week
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,176Updated this week
- System 2 Reasoning Link Collection☆844Updated 4 months ago
- Cost aware hyperparameter tuning algorithm☆162Updated last year
- The Tensor (or Array)☆437Updated 11 months ago
- High performance hybrid classical-quantum computing learning framework written in C☆445Updated 5 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,804Updated 3 weeks ago
- The Autograd Engine☆621Updated 10 months ago
- RL for total beginners☆93Updated last month