wisnunugroho21 / reinforcement_learning_truly_ppoLinks

Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch

☆20

Alternatives and similar repositories for reinforcement_learning_truly_ppo

Users that are interested in reinforcement_learning_truly_ppo are comparing it to the libraries listed below

Sorting:

wisnunugroho21 / reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…
☆52Updated 4 years ago
BY571 / Munchausen-RL
PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆45Updated 4 years ago
SwapnilPande / MOReL
Model-Based Offline Reinforcement Learning
☆50Updated 4 years ago
BY571 / FQF-and-Extensions
PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…
☆32Updated 4 years ago
siekmanj / r2l
Recurrent continuous reinforcement learning algorithms implemented in Pytorch.
☆51Updated 4 years ago
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
robintyh1 / onpolicybaselines
on-policy optimization baselines for deep reinforcement learning
☆30Updated 5 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
mike-gimelfarb / deep-successor-features-for-transfer
A reusable framework for successor features for transfer in deep reinforcement learning using keras.
☆44Updated 4 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆55Updated 3 months ago
GRAAL-Research / OfflineRLReadingGroup
Offline Reinforcement Learning Reading Group
☆27Updated 2 years ago
joeybose / FloRL
Implicit Normalizing Flows + Reinforcement Learning
☆61Updated 6 years ago
BY571 / Deep-Reinforcement-Learning-Algorithm-Collection
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
☆77Updated 4 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆90Updated 2 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆92Updated last week
ReinholdM / Papers-of-Offline-RL
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Updated 3 years ago
thanhnguyentang / mmdrl
Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354
☆26Updated 3 years ago
vincent-thevenin / DreamerV2-Pytorch
Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS
☆50Updated 3 years ago
schmidtdominik / Rainbow
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆45Updated 3 years ago
Deepest-Project / WorldModels-A3C
World Models with A3C on Carracing-v0 in gym
☆31Updated 5 years ago
alirezakazemipour / Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆29Updated last month
Howuhh / faster-trajectory-transformer
Implementation of Trajectory Transformer with attention caching and batched beam search
☆111Updated 2 years ago
google-research / pisac
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
☆44Updated 2 years ago
antonai91 / reinforcement_learning
☆15Updated 4 years ago
BY571 / QR-DQN
PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression
☆28Updated 4 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆86Updated 4 years ago
thu-rllab / CFCQL
Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.
☆36Updated 4 months ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆50Updated last month
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
soumik12345 / DDPG
Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control
☆26Updated 2 years ago