haarnoja / sacLinks

Soft Actor-Critic

☆1,123

Alternatives and similar repositories for sac

Users that are interested in sac are comparing it to the libraries listed below

Sorting:

rail-berkeley / softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,328Updated last year
pranz24 / pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
☆898Updated last month
Khrylx / PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…
☆1,239Updated 4 years ago
reinforcement-learning-kr / lets-do-irl
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
☆762Updated last year
quantumiracle / Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…
☆1,272Updated 5 months ago
ghliu / pytorch-ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
☆616Updated 7 years ago
sfujim / TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
☆1,930Updated 2 years ago
MrSyee / pg-is-all-you-need
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
☆955Updated 2 months ago
sisl / MADRL
Repo containing code for multi-agent deep reinforcement learning (MADRL).
☆713Updated 2 years ago
denisyarats / pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
☆556Updated 3 years ago
rail-berkeley / rlkit
Collection of reinforcement learning algorithms
☆2,743Updated last year
rlcode / per
Prioritized Experience Replay (PER) implementation in PyTorch
☆347Updated 5 years ago
shariqiqbal2810 / MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
☆759Updated 3 years ago
MorvanZhou / pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
☆649Updated 2 years ago
xuehy / pytorch-maddpg
A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
☆669Updated 7 years ago
shariqiqbal2810 / maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
☆648Updated 5 years ago
oxwhirl / pymarl
Python Multi-Agent Reinforcement Learning framework
☆2,067Updated 2 years ago
sfujim / BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆637Updated 4 years ago
vy007vikas / PyTorch-ActorCriticRL
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
☆412Updated 4 years ago
MatthewJA / Inverse-Reinforcement-Learning
Implementations of selected inverse reinforcement learning algorithms.
☆1,033Updated 2 years ago
ChenglongChen / pytorch-DRL
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
☆596Updated 7 years ago
katerakelly / oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
☆498Updated 2 years ago
TianhongDai / reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…
☆682Updated 4 years ago
qfettes / DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
☆1,078Updated 4 years ago
araffin / rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
☆1,180Updated 2 years ago
oxwhirl / smac
SMAC: The StarCraft Multi-Agent Challenge
☆1,239Updated last year
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆507Updated 2 years ago
ikostrikov / pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆1,284Updated 5 years ago
dxyang / DQN_pytorch
Vanilla DQN, Double DQN, and Dueling DQN implemented in PyTorch
☆531Updated 7 years ago
yrlu / irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
☆642Updated last year