wisnunugroho21 / reinforcement_learning_truly_ppo
Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch
☆20Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_truly_ppo:
Users that are interested in reinforcement_learning_truly_ppo are comparing it to the libraries listed below
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 3 weeks ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆51Updated 4 years ago
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- Official code of Nash-DQN for paper: Nash-DQN algorithm for two-player zero-sum Markov games, details see our paper: A Deep Reinforcement…☆19Updated 2 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆45Updated 2 years ago
- Solution for Taxi env using HRL (Hierarchical reinforcement learning) (2018)☆21Updated 5 years ago
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆21Updated last year
- A new paper list for multi-agent reinforcement learning (actively maintained)☆25Updated 5 years ago
- Distributional Soft Actor Critic☆52Updated 4 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆27Updated 3 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- DecentralizedLearning☆24Updated 2 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆26Updated 4 years ago
- ☆15Updated 3 years ago
- ☆27Updated 4 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Updated 2 years ago
- ☆18Updated 2 years ago
- Deep Q-Network (DQN) with Prioritized Experience Replay (PER)☆16Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆30Updated 3 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆30Updated 2 years ago
- World Models with A3C on Carracing-v0 in gym☆33Updated 5 years ago
- Code for demonstration example-task in RUDDER blog☆23Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 4 years ago
- Deep RL agents with PyTorch☆35Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆12Updated 11 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago