orrivlin / Hindsight-Experience-Replay---Bit-Flipping
Simple bit flipping with sparse rewards using HER, similarly to the original paper
☆40Updated 6 years ago
Alternatives and similar repositories for Hindsight-Experience-Replay---Bit-Flipping:
Users that are interested in Hindsight-Experience-Replay---Bit-Flipping are comparing it to the libraries listed below
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆175Updated 8 months ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆26Updated 5 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- Soft Actor-Critic with advanced features☆49Updated last week
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆53Updated 6 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆137Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 2 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆33Updated 5 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆87Updated 2 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- ☆74Updated 10 months ago
- Curiosity-driven Exploration by Self-supervised Prediction☆137Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆125Updated 8 months ago
- ☆25Updated 7 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆105Updated 5 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Updated 4 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆54Updated 3 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 2 years ago
- ☆41Updated 5 years ago
- Gridworld for MARL experiments☆139Updated 4 years ago