addy1997 / GridworldLinks
OpenAI gym-based algorithm for the grid world problem
☆28Updated 4 years ago
Alternatives and similar repositories for Gridworld
Users that are interested in Gridworld are comparing it to the libraries listed below
Sorting:
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 11 months ago
- Lightweight multi-agent gridworld Gym environment☆208Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 5 years ago
- Implementation of HindSight Experience Replay paper with Pytorch☆29Updated 4 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆144Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆128Updated 10 months ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆123Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆142Updated 6 years ago
- Solving POMDP using Recurrent networks☆86Updated 5 years ago
- A collection of pre-trained RL agents using Stable Baselines3☆130Updated 7 months ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆56Updated 3 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated last year
- ☆40Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆182Updated 9 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆201Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆50Updated last month
- Gridworld for MARL experiments☆140Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆317Updated 3 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆99Updated 5 years ago
- Collection of OpenAI parametrized action-space environments.☆65Updated 3 months ago
- This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …☆119Updated 7 months ago
- There will be updates later☆85Updated 6 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆40Updated 6 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆178Updated 11 months ago
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆106Updated 6 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆46Updated 9 months ago
- ☆198Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆161Updated 2 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆288Updated 4 years ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆77Updated 4 years ago