Guillaume-Cr / lunar_lander_perLinks
☆17Updated 5 years ago
Alternatives and similar repositories for lunar_lander_per
Users that are interested in lunar_lander_per are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Updated 4 years ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆245Updated 11 months ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 5 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Updated 4 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆90Updated 2 years ago
- POMDP wrappers for OpenAI Gym☆15Updated 6 years ago
- ☆187Updated 3 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆294Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆158Updated last year
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29Updated 7 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆79Updated 5 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆102Updated 7 months ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆44Updated 3 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆142Updated last year
- Collection of OpenAI parametrized action-space environments.☆67Updated 9 months ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Updated 2 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆148Updated 6 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆200Updated last year
- Lightweight multi-agent gridworld Gym environment☆213Updated 2 years ago
- OpenAI Gym environment designed for training RL agents to control the flight of a two-dimensional drone.☆56Updated 2 months ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆30Updated 7 years ago
- Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3☆102Updated 3 years ago
- ppo-lstm-parallel☆48Updated 6 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Updated last year
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆200Updated last year
- Emergence of complex strategies through multiagent competition☆45Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆54Updated 2 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆70Updated 7 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆137Updated 4 months ago