Ending2015a / unstable_baselines
A TF2.0 implementation of RL baselines.
β10Updated 3 years ago
Alternatives and similar repositories for unstable_baselines:
Users that are interested in unstable_baselines are comparing it to the libraries listed below
- Implicit Normalizing Flows + Reinforcement Learningβ61Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationβ40Updated 5 months ago
- π§Ά Minimal PyTorch Soft Actor Critic (SAC) implementationβ38Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]β37Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policiesβ20Updated 3 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimationβ21Updated 6 years ago
- Generalised UDRLβ37Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.β33Updated 2 years ago
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was β¦β15Updated 2 years ago
- Revisiting Rainbowβ74Updated 3 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)β23Updated 5 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQNβ45Updated 4 years ago
- π΄ OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)β24Updated 3 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithmβ45Updated 2 years ago
- Exploring whether DRQN + action prior + state-based expert + history-based entropy-reduction expertβ8Updated 4 years ago
- Collection of reinforcement learning algorithmsβ15Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ68Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"β44Updated 2 years ago
- AGAC: Adversarially Guided Actor-Criticβ48Updated 3 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RLβ43Updated 3 years ago
- OpenAi's gym environment wrapper to vectorize them with Rayβ22Updated last year
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learningβ49Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradientsβ32Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variablesβ71Updated 2 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"β30Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorchβ35Updated last month
- β42Updated 4 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemysβ¦β21Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Predictionβ19Updated 5 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorchβ17Updated 2 years ago