mjanschek / pytorch_seed_rl
A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Updated 4 years ago
Alternatives and similar repositories for pytorch_seed_rl
Users that are interested in pytorch_seed_rl are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆31Updated 4 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- ☆30Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- ICRL 2020☆19Updated 5 years ago
- FEN Code☆37Updated 5 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆52Updated 4 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆31Updated 6 years ago
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Updated 5 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆31Updated 3 years ago
- Soft Actor-Critic with advanced features☆50Updated last week
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 2 months ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 2 months ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆23Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Updated 5 years ago
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆34Updated 4 years ago
- Random Network Distillation(RND) algo in Pytorch☆49Updated 6 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆27Updated 4 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- ☆76Updated 11 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Learning Individual Intrinsic Reward in MARL☆62Updated 2 years ago