alirezakazemipour / Continuous-PPOLinks
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆29Updated 8 months ago
Alternatives and similar repositories for Continuous-PPO
Users that are interested in Continuous-PPO are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆90Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆102Updated 8 months ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Updated 4 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Updated 3 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Updated 4 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆122Updated 5 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆53Updated 8 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆50Updated last year
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
- Collection of OpenAI parametrized action-space environments.☆69Updated 10 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Updated 10 months ago
- Lightweight multi-agent gridworld Gym environment☆213Updated 2 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆86Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆202Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆86Updated 2 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆85Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆46Updated last year
- Datasets with baselines for Offline MARL.☆201Updated 2 months ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated 2 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆59Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 5 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆204Updated last year
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆142Updated last year
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆67Updated 2 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Updated last year
- An unofficial implementation for online decision transformer☆41Updated 3 years ago
- A collection of recent MARL papers☆104Updated last year