QasimWani / policy-value-methods
Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.
☆23Updated 4 years ago
Alternatives and similar repositories for policy-value-methods
Users that are interested in policy-value-methods are comparing it to the libraries listed below
Sorting:
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- ☆185Updated 3 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆223Updated 3 months ago
- 📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️☆52Updated last year
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆97Updated this week
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆288Updated 4 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆55Updated 3 years ago
- Lightweight multi-agent gridworld Gym environment☆208Updated last year
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆144Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆100Updated 3 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆68Updated 3 weeks ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆89Updated 2 months ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆180Updated 8 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆140Updated 6 years ago
- ☆17Updated 4 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆178Updated 10 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆142Updated last year
- PyTorch implementation of SAC-Discrete.☆302Updated 9 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆77Updated 4 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆49Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆315Updated 3 years ago
- Example code for the Gym documentation☆71Updated last year
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- PyTorch implementation of DDPG for continuous control tasks.☆46Updated 5 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆52Updated 4 years ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆118Updated 6 months ago
- An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments☆286Updated 2 months ago
- Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning☆64Updated 2 years ago