ArthurFirmino / gym-battlesnakeLinks
Multi-agent reinforcement learning environment
☆37Updated 6 years ago
Alternatives and similar repositories for gym-battlesnake
Users that are interested in gym-battlesnake are comparing it to the libraries listed below
Sorting:
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Updated 6 years ago
- Clone of OpenAI's Spinning Up in PyTorch☆152Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆75Updated 3 years ago
- An OpenAI gym environment made for RL☆70Updated last year
- impact-driven-exploration☆132Updated last year
- OpenAI Gym wrapper for ViZDoom enviroments☆69Updated 4 years ago
- Hindsight Experience Replay - Bit flipping experiment in Tensorflow☆58Updated 6 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆140Updated 2 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆205Updated 4 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 2 years ago
- Gridworld for MARL experiments☆141Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆175Updated 3 years ago
- The submission template for the MineRL Competition @ NeurIPS 2021. Clone this to make a new submission!☆92Updated 3 years ago
- Gridworld environments for OpenAI gym.☆79Updated last year
- Revisiting Rainbow☆75Updated 4 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆149Updated 4 years ago
- megastep helps you build 1-million FPS reinforcement learning environments on a single GPU☆140Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- Benchmarking TD3 and DDPG on PyBullet☆54Updated 6 years ago
- RUDDER: Return Decomposition for Delayed Rewards☆48Updated 4 years ago
- ☆91Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆71Updated 8 years ago
- ☆40Updated last week
- A simple example of how to implement vector based DQN using PyTorch and a ML-Agents environment☆93Updated 6 years ago
- A structured implementation of MuZero☆205Updated 3 years ago
- Performances of Reinforcement Learning Agents☆53Updated 5 years ago
- Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.☆101Updated 5 years ago
- Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…☆126Updated 5 years ago