ngc92 / space-wrappers
General purpose environment wrappers for openai gym
☆24Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for space-wrappers
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆66Updated 4 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated last year
- Combining Evolutionary Algorithms and deep RL in various ways☆99Updated 4 years ago
- Deep RL agents with PyTorch☆35Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆70Updated 7 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆23Updated 5 years ago
- ☆81Updated 3 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Episodic Control☆19Updated 2 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- ☆97Updated last year
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Updated 4 years ago
- ☆14Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆74Updated 2 years ago
- ☆71Updated 5 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆42Updated 4 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 5 years ago
- Implementation of Soft Actor Critic☆37Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆186Updated last year
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- Proximal Policy Optimization implementation with TensorFlow☆104Updated 6 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆44Updated 6 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆66Updated 4 years ago
- Soft Actor-Critic with advanced features☆47Updated last month