alirezakazemipour / Continuous-PPOLinks
Proximal Policy Optimization (Continuous Version) in PyTorch.
☆29Updated 2 months ago
Alternatives and similar repositories for Continuous-PPO
Users that are interested in Continuous-PPO are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆89Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆100Updated 2 months ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆101Updated 3 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆83Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆37Updated 4 months ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆146Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆183Updated last year
- Collection of OpenAI parametrized action-space environments.☆65Updated 4 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆69Updated 2 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆102Updated 9 months ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Updated last year
- A collection of recent MARL papers☆94Updated 8 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆53Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆188Updated 10 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆51Updated 2 months ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆41Updated 3 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆97Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆96Updated this week
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago
- Datasets with baselines for Offline MARL.☆176Updated 2 weeks ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 3 weeks ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Updated 2 years ago
- Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.☆50Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆103Updated 5 years ago
- OpenAi's gym environment wrapper to vectorize them with Ray☆23Updated 2 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆82Updated 2 years ago