alirezakazemipour / Continuous-PPOLinks

Proximal Policy Optimization (Continuous Version) in PyTorch.

☆29

Alternatives and similar repositories for Continuous-PPO

Users that are interested in Continuous-PPO are comparing it to the libraries listed below

Sorting:

BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆89Updated 2 years ago
alirezakazemipour / DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
☆100Updated 2 months ago
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆101Updated 3 years ago
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆83Updated last year
MarcoMeter / neroRL
Deep Reinforcement Learning Framework done with PyTorch
☆37Updated 4 months ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 4 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆146Updated 3 years ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year
thomashirtz / gym-hybrid
Collection of OpenAI parametrized action-space environments.
☆65Updated 4 months ago
fschur / DDQN-with-PyTorch-for-OpenAI-Gym
Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.
☆69Updated 2 months ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆102Updated 9 months ago
quantumiracle / MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
☆49Updated last year
chrisyrniu / Recent-Advances-in-Multi-Agent-Reinforcement-Learning
A collection of recent MARL papers
☆94Updated 8 months ago
instadeepai / awesome-marl
A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers
☆53Updated 2 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆188Updated 10 months ago
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
rmst / rlrd
PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)
☆41Updated 3 years ago
google-research / reincarnating_rl
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
☆97Updated 2 years ago
OpenRL-Lab / TiZero
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆61Updated last year
Farama-Foundation / momaland
Benchmarks for Multi-Objective Multi-Agent Decision Making
☆96Updated this week
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
instadeepai / og-marl
Datasets with baselines for Offline MARL.
☆176Updated 2 weeks ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 3 weeks ago
j3soon / dfac
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆32Updated 2 years ago
gkswamy98 / fast_irl
Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.
☆50Updated 2 years ago
toshikwa / soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
☆103Updated 5 years ago
ingambe / RayEnvWrapper
OpenAi's gym environment wrapper to vectorize them with Ray
☆23Updated 2 years ago
Improbable-AI / eipo
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆82Updated 2 years ago