kvsnoufal / reinforce

Implementing REINFORCE algorithm on Pong, Lunar Lander and Cartplot + Medium Article

☆22

Alternatives and similar repositories for reinforce:

Users that are interested in reinforce are comparing it to the libraries listed below

mdeib / berkeley-deep-RL-pytorch-starter
Pytorch starter code for UC Berkeley's cs285 assignments
☆71Updated 3 years ago
ElisevanderPol / symmetrizer
☆31Updated 4 years ago
kushagra06 / SAC
Pytorch implementation of Soft Actor-Critic
☆18Updated 5 years ago
MishaLaskin / torchingup
TorchingUp provides minimal implementations of common Reinforcement Learning algorithms written in PyTorch. It is meant to complement Ope…
☆47Updated 2 years ago
Ankur-Deka / Emergent-Multiagent-Strategies
Emergence of complex strategies through multiagent competition
☆44Updated 2 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆83Updated 3 years ago
kzl / lifelong_rl
Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…
☆104Updated 3 years ago
jcwleo / curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
☆137Updated 2 years ago
social-dilemma / multiagent
Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas
☆49Updated 2 years ago
Pervasive-AI-Lab / crlmaze
Continual Reinforcement Learning in 3D Non-stationary Environments
☆37Updated 5 years ago
joonaspu / video-game-behavioural-cloning
Behavioural cloning experiments with video games
☆33Updated 5 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆74Updated 3 years ago
moratodpg / imp_marl
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
☆41Updated 7 months ago
BY571 / Deep-Reinforcement-Learning-Algorithm-Collection
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
☆79Updated 4 years ago
hermesdt / reinforcement-learning
☆39Updated 4 years ago
younggyoseo / RE3
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆68Updated 3 years ago
ml-jku / OfflineRL
☆31Updated 2 years ago
wendelinboehmer / dcg
☆75Updated 10 months ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 4 years ago
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆204Updated last year
yifan12wu / rl-laplacian
Learning Laplacian Representations in Reinforcement Learning
☆17Updated 4 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
schroederdewitt / mackrl
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Updated 5 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆49Updated 2 years ago
acyclics / MPO
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆27Updated 4 years ago
wisnunugroho21 / asynchronous_impala_PPO
Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation
☆36Updated 4 years ago
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆48Updated 3 years ago
hamishs / JAX-RL
JAX implementations of various deep reinforcement learning algorithms.
☆21Updated 2 months ago
Bellman-devs / bellman
Model-based reinforcement learning in TensorFlow
☆55Updated 3 years ago
flowersteam / TeachMyAgent
TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.
☆71Updated last year