QasimWani / policy-value-methodsLinks
Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.
☆23Updated 4 years ago
Alternatives and similar repositories for policy-value-methods
Users that are interested in policy-value-methods are comparing it to the libraries listed below
Sorting:
- ☆184Updated 3 years ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆227Updated 4 months ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆99Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆98Updated 3 weeks ago
- 📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️☆53Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆181Updated 8 months ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆208Updated last year
- Multi-objective Gymnasium environments for reinforcement learning☆326Updated 3 months ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆289Updated 4 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆100Updated 3 years ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆69Updated last week
- Minimal implementation of multi-agent reinforcement learning algorithms☆55Updated 3 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆127Updated 7 months ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆144Updated 3 years ago
- PyTorch implementation of the intrinsic curiosity module (ICM) and A3C a;lgorithm☆22Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆317Updated 3 years ago
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆30Updated 5 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Updated 3 years ago
- Datasets with baselines for offline multi-agent reinforcement learning.☆170Updated 3 weeks ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆177Updated 10 months ago
- Curiosity-driven Exploration by Self-supervised Prediction☆138Updated 2 years ago
- PyTorch implementation of SAC-Discrete.☆302Updated 10 months ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆141Updated 6 years ago
- Gridworld environments for OpenAI gym.☆80Updated last year
- Simple Grid Environment for Gymnasium☆58Updated 3 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆174Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆352Updated 2 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆28Updated 3 weeks ago