kachayev / gym-microrts-paper-sb3Links
RL agent to play μRTS with Stable-Baselines3 and PyTorch
☆26Updated 3 years ago
Alternatives and similar repositories for gym-microrts-paper-sb3
Users that are interested in gym-microrts-paper-sb3 are comparing it to the libraries listed below
Sorting:
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Generalised UDRL☆37Updated 3 years ago
- Variational Reinforcement Learning☆16Updated 10 months ago
- ☆17Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- ☆14Updated 5 years ago
- ☆16Updated 4 years ago
- ☆43Updated 4 years ago
- ☆28Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Inferring beliefs about dynamics from behavior☆29Updated 7 years ago
- ☆20Updated 5 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆34Updated 2 years ago
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Updated 5 years ago
- ☆21Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆59Updated 8 months ago