orrivlin / Hindsight-Experience-Replay---Bit-FlippingLinks
Simple bit flipping with sparse rewards using HER, similarly to the original paper
☆39Updated 6 years ago
Alternatives and similar repositories for Hindsight-Experience-Replay---Bit-Flipping
Users that are interested in Hindsight-Experience-Replay---Bit-Flipping are comparing it to the libraries listed below
Sorting:
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆142Updated 6 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆51Updated 2 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated last week
- Adaptive Attention Span for Reinforcement Learning☆133Updated 5 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆178Updated 11 months ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆99Updated 5 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆300Updated last year
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Updated 2 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆145Updated 3 years ago
- Gridworld for MARL experiments☆141Updated 4 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆138Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆201Updated 2 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Updated 3 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- ☆92Updated 4 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆175Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆259Updated 5 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- 🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.☆69Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆317Updated 3 years ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 6 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 8 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆42Updated 6 years ago