thomashirtz / gym-battleship
Battleship environment for reinforcement learning tasks
☆12Updated last year
Alternatives and similar repositories for gym-battleship:
Users that are interested in gym-battleship are comparing it to the libraries listed below
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆49Updated 2 years ago
- The Starcraft Multi-Agent challenge lite☆41Updated 5 months ago
- ☆70Updated last year
- A2C is a special case of PPO!☆19Updated 2 years ago
- A tool for aggregating and plotting MARL experiment data.☆72Updated last month
- ☆34Updated 2 years ago
- Explainable Reinforcement Learning (XRL) Resources☆37Updated 5 months ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated 10 months ago
- Multi-Agent Reinforcement Learning with Stable-Baselines3☆18Updated 3 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 5 months ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆21Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆74Updated last year
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆28Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Minimal code for A Generalist Agent☆39Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆69Updated 6 months ago
- JAX and PZ RL envs + algorithms for swarms of CrazyFlies☆72Updated 6 months ago
- Collection of RL Environments built using Madrona☆28Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 4 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆35Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆92Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆108Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆76Updated last year
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆24Updated 2 years ago
- Reinforcement learning framework.☆13Updated 3 months ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆18Updated last year
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆35Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆69Updated 8 months ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆174Updated 5 months ago
- ☆40Updated 3 years ago