Miffyli / rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for rl-human-prior-tricks
- GPT implementation in Flax☆18Updated 2 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆27Updated 4 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- Clockwork VAEs in JAX/Flax☆32Updated 3 years ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆35Updated last year
- Generalised UDRL☆37Updated 2 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago
- Combining NEAT and novelty search to quickly generate diverse video game levels (GECCO 2022). https://arxiv.org/abs/2204.06934☆15Updated 2 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆27Updated last year
- ☆21Updated 4 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆21Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- ☆28Updated 2 years ago
- ☆16Updated 3 years ago
- Write simple games in Numpy!☆12Updated 2 years ago
- ☆15Updated 2 years ago
- ☆20Updated 5 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- PyTorch implementation of DARLA preprocessing models☆11Updated 6 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- The Official Implementation of Domain Adaptive Imitation Learning (DAIL)☆22Updated 4 years ago
- ☆17Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- A minimal implementation of Go-Explore without domain knowledge☆13Updated 3 years ago
- ☆22Updated 3 years ago