thomashirtz / gym-battleshipLinks
Battleship environment for reinforcement learning tasks
☆14Updated 2 years ago
Alternatives and similar repositories for gym-battleship
Users that are interested in gym-battleship are comparing it to the libraries listed below
Sorting:
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆102Updated 7 months ago
- Multi-Agent Reinforcement Learning with Stable-Baselines3☆20Updated 4 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆120Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆98Updated 2 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆157Updated 2 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆43Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆200Updated last year
- ☆73Updated 2 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆75Updated 2 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆67Updated 5 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29Updated 7 months ago
- The Starcraft Multi-Agent challenge lite☆43Updated last year
- PyTorch implementation of GAIL and PPO reinforcement learning algorithms☆25Updated 4 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Updated 4 years ago
- ☆246Updated last year
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Updated 4 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- A2C is a special case of PPO!☆22Updated 3 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Updated 4 years ago
- Lightweight multi-agent gridworld Gym environment☆212Updated 2 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆85Updated 2 years ago
- ☆23Updated last year
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Updated 4 years ago
- Collection of RL Environments built using Madrona☆37Updated 2 years ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆46Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆363Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Updated 2 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 5 years ago