orrivlin / Hindsight-Experience-Replay---Bit-FlippingLinks
Simple bit flipping with sparse rewards using HER, similarly to the original paper
☆40Updated 6 years ago
Alternatives and similar repositories for Hindsight-Experience-Replay---Bit-Flipping
Users that are interested in Hindsight-Experience-Replay---Bit-Flipping are comparing it to the libraries listed below
Sorting:
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆141Updated 6 years ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 3 weeks ago
- Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/☆97Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆125Updated 5 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆100Updated 3 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆29Updated 6 years ago
- ☆41Updated 5 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆27Updated 5 years ago
- Solving POMDP using Recurrent networks☆87Updated 4 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆177Updated 10 months ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 10 months ago
- Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.☆89Updated 6 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 7 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- Soft Actor-Critic with advanced features☆50Updated this week
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- Adaptive Attention Span for Reinforcement Learning☆133Updated 5 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆83Updated 4 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆85Updated last year
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 5 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆289Updated 4 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- Implementation of Soft Actor Critic☆37Updated 3 years ago