mitchellgoffpc / flatland-training
Experiments with flatland.aicrowd.com
☆8Updated last year
Alternatives and similar repositories for flatland-training:
Users that are interested in flatland-training are comparing it to the libraries listed below
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated 2 years ago
- ☆21Updated 4 years ago
- The Path to Nash Equilibrium☆38Updated 2 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago
- ☆28Updated 2 years ago
- Collection of in-progress libraries for entity neural networks.☆29Updated 2 years ago
- ☆16Updated 3 years ago
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- ☆18Updated last year
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Reward Learning by Simulating the Past☆44Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆34Updated 5 years ago
- Safe Reinforcement Learning algorithms☆74Updated 2 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Updated 3 years ago
- Minimal A2C/A3C example of an LSTM-based meta-learner.☆13Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Pytorch implementation of the Deep Deterministic Policy Gradients for Continuous Control☆26Updated 2 years ago
- ☆20Updated 5 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆84Updated 3 years ago
- Experiments in applying interpretability techniques to learned reward functions.☆9Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago