hermesdt / reinforcement-learning
☆39Updated 4 years ago
Alternatives and similar repositories for reinforcement-learning:
Users that are interested in reinforcement-learning are comparing it to the libraries listed below
- Curiosity-driven Exploration by Self-supervised Prediction☆136Updated last year
- Gridworld for MARL experiments☆138Updated 4 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆122Updated 6 months ago
- Lightweight multi-agent gridworld Gym environment☆201Updated last year
- Level-based Foraging (LBF): A multi-agent environment for RL☆171Updated 5 months ago
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- A well-documented A2C written in PyTorch☆50Updated 5 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆193Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated 6 months ago
- Pytorch starter code for UC Berkeley's cs285 assignments☆71Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆44Updated 2 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆141Updated 3 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆135Updated 6 years ago
- ☆191Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆169Updated 6 months ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆161Updated 2 years ago
- Implementation of the Option-Critic Architecture☆38Updated 6 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 2 years ago
- Partially Observable Process Gym☆178Updated 7 months ago
- An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.☆79Updated 3 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆77Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆134Updated 9 months ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆56Updated 4 years ago
- MADDPG in Ray/RLlib☆52Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆238Updated 4 years ago
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆38Updated 4 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆166Updated 8 months ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 5 years ago