QasimWani / policy-value-methods
Deep Reinforcement Learning algorithms for Policy Value methods written from scratch.
☆19Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for policy-value-methods
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆96Updated 2 years ago
- 📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️☆49Updated 6 months ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆208Updated last year
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆161Updated 2 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆66Updated 3 months ago
- ☆181Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆89Updated last year
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆26Updated 3 years ago
- Multi-objective Gymnasium environments for reinforcement learning☆289Updated 3 weeks ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆173Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆271Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆296Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆198Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆140Updated last week
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 5 months ago
- RLlib tutorials☆64Updated 2 years ago
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆203Updated last year
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆400Updated 2 years ago
- Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.☆50Updated last year
- An RL agent for the Google Football environment☆92Updated 3 years ago
- Gridworld environments for OpenAI gym.☆80Updated 9 months ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆170Updated last year
- Deep recurrent Q learning on CartPole-v1 environment☆74Updated 10 months ago
- SARSA, Q-Learning, Expected SARSA, SARSA(λ) and Double Q-learning Implementation and Analysis☆28Updated 5 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆105Updated 2 weeks ago
- Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG☆63Updated 5 years ago
- PyTorch 1.x Reinforcement Learning Cookbook, published by Packt☆96Updated last year