adysonmaia / sb3-plusLinks
Additional DRL algorithms to the StableBaselines3 lib
☆18Updated last year
Alternatives and similar repositories for sb3-plus
Users that are interested in sb3-plus are comparing it to the libraries listed below
Sorting:
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆80Updated last year
- Partially Observable Process Gym☆203Updated 4 months ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆89Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆151Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆155Updated last year
- ☆45Updated 2 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 3 months ago
- The Starcraft Multi-Agent challenge lite☆41Updated last year
- Fast and flexible multi-agent gridworld reinforcement learning environments.☆44Updated 7 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 3 years ago
- IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL☆44Updated 2 weeks ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆53Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆68Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆76Updated 2 years ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆81Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆173Updated 11 months ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆39Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆193Updated last year
- ☆49Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆104Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆111Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆91Updated 11 months ago
- Benchmarking RL generalization in an interpretable way.☆166Updated last week
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Repo for Implicit Diffusion Q-Learning☆116Updated last year
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆218Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆61Updated last month
- JAX implementation of RL algorithms and vectorized environments☆49Updated last year
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago