Miffyli / minecraft-bc
Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for minecraft-bc
- Behavioural cloning experiments with video games☆30Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆52Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- ☆41Updated 3 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Updated last year
- ☆29Updated 3 years ago
- ☆37Updated 2 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 4 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆61Updated 4 years ago
- ☆36Updated last year
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Reinforcement Learning with Latent Flow☆43Updated 3 years ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- Hierarchical Self-Play☆21Updated 5 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- ☆29Updated 5 years ago
- ☆28Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated last year
- Episodic Control☆19Updated 2 years ago
- A minimal implementation of Go-Explore without domain knowledge☆13Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- ☆14Updated 3 years ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆50Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago