PufferAI / PufferLibLinks
Simplifying reinforcement learning for complex game environments
☆2,143Updated this week
Alternatives and similar repositories for PufferLib
Users that are interested in PufferLib are comparing it to the libraries listed below
Sorting:
- LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.☆597Updated 8 months ago
- Mastering Diverse Domains through World Models☆1,944Updated 2 months ago
- Really Fast End-to-End Jax RL Implementations☆899Updated 9 months ago
- NanoGPT (124M) in 3 minutes☆2,699Updated last week
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆335Updated last week
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆319Updated last month
- Tutorials on tinygrad☆385Updated last week
- The Fastest Deep Reinforcement Learning Library☆821Updated last month
- RL Environments in JAX 🌍☆771Updated 3 weeks ago
- Distributed Reinforcement Learning accelerated by Lightning Fabric☆380Updated this week
- Massively parallel rigidbody physics simulation on accelerator hardware.☆2,727Updated last week
- Can you design a controller to steer a simulated car?☆249Updated 5 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,796Updated this week
- (WIP) A small but powerful, homemade PyTorch from scratch.☆554Updated this week
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.☆783Updated last week
- The Autograd Engine☆616Updated 9 months ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆535Updated 9 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆3,387Updated 7 months ago
- Multi-Agent Reinforcement Learning with JAX☆599Updated 3 weeks ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,481Updated 5 months ago
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,826Updated 6 months ago
- System 2 Reasoning Link Collection☆838Updated 3 months ago
- The Tensor (or Array)☆436Updated 10 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆902Updated last year
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,026Updated 3 weeks ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆555Updated 11 months ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆467Updated this week
- UNet diffusion model in pure CUDA☆608Updated 11 months ago
- ♟️ Vectorized RL game environments in JAX☆487Updated 3 months ago
- Cost aware hyperparameter tuning algorithm☆158Updated last year