yashbonde / freeciv-python
This is the learning environment for Freeciv 3.1 with python bindings for advancements in RL. This is the first project of it's kind in the world and will also be the most challenging environment out there.
☆39Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for freeciv-python
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆76Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆76Updated 5 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- fork of rl-baseline-zoo☆21Updated 4 years ago
- Augmented environments with RL☆102Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago
- Training (hopefully) safe agents in gridworlds☆25Updated 5 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- World Models applied to the Open AI Sonic Retro Contest☆77Updated 6 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- ☆32Updated 6 years ago
- StarCraft: BroodWars OpenAI Gym environment☆81Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- ☆44Updated 5 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 6 years ago