orrivlin / Hindsight-Experience-Replay---Bit-FlippingLinks

Simple bit flipping with sparse rewards using HER, similarly to the original paper

☆39

Alternatives and similar repositories for Hindsight-Experience-Replay---Bit-Flipping

Users that are interested in Hindsight-Experience-Replay---Bit-Flipping are comparing it to the libraries listed below

Sorting:

BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 4 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 5 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆182Updated last year
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆205Updated 2 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 3 weeks ago
jsztompka / MultiAgent-PPO
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆29Updated 6 years ago
cyoon1729 / RLcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
☆301Updated last year
siekmanj / r2l
Recurrent continuous reinforcement learning algorithms implemented in Pytorch.
☆51Updated 4 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆317Updated 2 years ago
archsyscall / DistRL-TensorFlow2
🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.
☆69Updated 4 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆146Updated 3 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆100Updated 6 years ago
kandouss / marlgrid
Gridworld for MARL experiments
☆141Updated 4 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆319Updated 3 years ago
createamind / DRL
☆92Updated 4 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆103Updated 4 years ago
shakti365 / soft-actor-critic
TF2 Implementation of the Soft Actor-Critic Algorithm
☆43Updated 2 years ago
YuhangSong / Arena-BuildingToolkit
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆84Updated 4 years ago
neka-nat / distributed_rl
Pytorch implementation of distributed deep reinforcement learning
☆76Updated 3 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆101Updated 3 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆139Updated 2 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago