alirezakazemipour / Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆26Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Continuous-PPO
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆89Updated last year
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆81Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆161Updated 2 months ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆50Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆43Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆45Updated 8 months ago
- A collection of pre-trained RL agents using Stable Baselines3☆105Updated 2 weeks ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆94Updated 4 years ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆21Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆120Updated 3 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆69Updated last year
- Experiments to train transformer network to master reinforcement learning environments.☆33Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆58Updated last year
- Datasets with baselines for offline multi-agent reinforcement learning.☆140Updated last week
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆66Updated 3 months ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆77Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆152Updated 5 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆51Updated 5 months ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆67Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆152Updated last week
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆50Updated 3 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Multi-Agent Reinforcement Learning with Stable-Baselines3☆18Updated 2 years ago