ChuaCheowHuan / reinforcement_learningLinks

My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.

☆37

Alternatives and similar repositories for reinforcement_learning

Users that are interested in reinforcement_learning are comparing it to the libraries listed below

Sorting:

DerwenAI / rllib_tutorials
RLlib tutorials
☆66Updated 3 years ago
archsyscall / DistRL-TensorFlow2
🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.
☆69Updated 4 years ago
shakti365 / soft-actor-critic
TF2 Implementation of the Soft Actor-Critic Algorithm
☆43Updated 2 years ago
Stable-Baselines-Team / rl-colab-notebooks
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
☆229Updated 5 months ago
philtabor / Actor-Critic-Methods-Paper-To-Code
☆184Updated 3 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
BY571 / Deep-Reinforcement-Learning-Algorithm-Collection
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
☆77Updated 4 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 8 months ago
jsztompka / MultiAgent-PPO
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆29Updated 6 years ago
fschur / DDQN-with-PyTorch-for-OpenAI-Gym
Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.
☆69Updated last month
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
DarylRodrigo / rl_lib
Series of deep reinforcement learning algorithms 🤖
☆29Updated 4 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆145Updated 3 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆133Updated last week
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 3 months ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆123Updated 4 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆89Updated 2 years ago
apourchot / CEM-RL
Combining Evolutionary Algorithms and deep RL in various ways
☆102Updated 4 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆317Updated 3 years ago
apourchot / ERL-pytorch
Combining Evolutionary Algorithms and deep Reinforcement Learning
☆16Updated 6 years ago
wisnunugroho21 / reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆53Updated 4 years ago
karush17 / esac
Evolution-based Soft Actor-Critic (ESAC)
☆42Updated 11 months ago
atavakol / action-branching-agents
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆117Updated 2 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆100Updated 3 years ago
udion / Transformer-RL
Experiments with transformer based RL algorithms
☆22Updated 5 years ago
BorealisAI / mtmfrl
Multi Type Mean Field Reinforcement Learning
☆31Updated 3 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 5 years ago
BlueFisher / Advanced-Soft-Actor-Critic
Soft Actor-Critic with advanced features
☆50Updated last week