MetcalfeTom / stable-baselines3-GPU
A GPU-accelerated fork of stable-baselines. Delivering reliable implementations of reinforcement learning algorithms.
☆23Updated 4 years ago
Alternatives and similar repositories for stable-baselines3-GPU:
Users that are interested in stable-baselines3-GPU are comparing it to the libraries listed below
- ☆20Updated 9 months ago
- ☆31Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 5 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆111Updated 3 years ago
- ☆74Updated last week
- Foundation Policies with Hilbert Representations (ICML 2024)☆80Updated 11 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated 2 weeks ago
- Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"☆14Updated 9 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆38Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Synchronized Curriculum Learning for RL Agents☆41Updated 2 weeks ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆40Updated last week
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Pytorch implementation of DreamerV2: Mastering Atari with Discrete World Models, based on the original implementation☆19Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆74Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Fast reinforcement learning research☆57Updated 3 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆36Updated 2 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆25Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆68Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆85Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆70Updated last year
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆45Updated 11 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆55Updated 5 months ago