StoneT2000 / Halite-4-Tournament-Runner
Runs a local halite 4 tournament with your agents ranked by trueskill/elo
☆18Updated last year
Related projects: ⓘ
- Kaggle Halite RL challenge - Provisional first place solution☆60Updated 3 years ago
- This is the code for Halite IV competition: https://www.kaggle.com/c/halite/overview☆9Updated 4 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 2 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆54Updated 5 years ago
- A reinforcement learning framework☆154Updated 5 years ago
- A simple moving dot environment for OpenAI Gym to test reinforcement learning algorithms☆21Updated 2 years ago
- 2019 talk at GECCO☆68Updated 5 years ago
- RL experiments☆69Updated last year
- HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own…☆282Updated 4 months ago
- This project was moved to: https://github.com/coax-dev/coax☆160Updated last year
- Implementation of TD-Gammon in TensorFlow.☆110Updated 5 years ago
- Publicly releasable baselines for the Retro contest☆128Updated 5 years ago
- Celular automaton-based calculus for the masses☆110Updated 4 years ago
- RLtime is a reinforcement learning library focused on state-of-the-art q-learning algorithms and features☆138Updated 4 years ago
- ☆196Updated last month
- Full World Models Implementation in Chainer☆165Updated 6 years ago
- This package allows to use PLE as a gym environment.☆73Updated 4 years ago
- safemutations☆143Updated 6 years ago
- Run evolution strategies on Google Kubernetes Engine☆31Updated last year
- Guided Evolutionary Strategies☆264Updated last year
- A simple stochastic OpenAI environment for training RL agents☆89Updated last year
- Collection of tutorials, exercises and papers on RL☆17Updated 6 years ago
- A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"☆152Updated 5 years ago
- C51-DDQN in Keras☆125Updated 6 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆188Updated last year
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆373Updated last year
- Replication of Uber Neuroevolution paper☆46Updated 6 years ago
- An implementation of the ideas from this paper https://arxiv.org/pdf/1803.10122.pdf☆280Updated last year
- Codes of our team for the OpenAI Retro Contest of reinforcement learning☆100Updated 6 years ago