karush17 / Evolution-Strategies-PyTorch
Implementation of OpenAI's Evolution Strategies in PyTorch.
☆20Updated 4 years ago
Related projects: ⓘ
- ☆13Updated this week
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Updated 11 months ago
- ☆11Updated this week
- ☆12Updated this week
- ☆12Updated this week
- ☆12Updated this week
- ☆13Updated this week
- Evolution-based Soft Actor-Critic (ESAC)☆39Updated last month
- ☆14Updated this week
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆27Updated 3 years ago
- Implementation of Eligibility Traces with Neural Networks in PyTorch and Tensorflow 2.0☆22Updated 3 years ago
- Prioritized Sequence Experience Replay☆10Updated 3 years ago
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆26Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆42Updated last year
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆48Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆27Updated 3 weeks ago
- Combining Evolutionary Algorithms and deep RL in various ways☆98Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆65Updated 4 years ago
- Episodic Control☆19Updated 2 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 3 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆25Updated 4 years ago
- PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression☆25Updated 4 years ago
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆28Updated 3 years ago
- General purpose environment wrappers for openai gym☆23Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 3 years ago
- The implement of GAIL with pytorch☆14Updated 4 years ago
- Asymmetric methods for partially observable reinforcement learning☆8Updated 4 months ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 5 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C), Asynchronous Advantage Option-Critic (A2OC), Proximal Policy Optimization (PPO) a…☆8Updated 5 years ago