Miffyli / rl-human-prior-tricksLinks
Evaluating different engineering tricks that make RL work
☆15Updated 4 years ago
Alternatives and similar repositories for rl-human-prior-tricks
Users that are interested in rl-human-prior-tricks are comparing it to the libraries listed below
Sorting:
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- Generalised UDRL☆37Updated 3 years ago
- Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"☆25Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 11 months ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆10Updated 3 years ago
- ☆20Updated 4 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- ☆10Updated 4 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆16Updated 4 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 6 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆35Updated 2 years ago
- ☆31Updated 2 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆26Updated 4 years ago
- ☆30Updated 3 years ago
- ☆23Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 4 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated last month
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Updated 10 months ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 4 years ago
- flexible meta-learning in jax☆14Updated last year
- A TF2.0 implementation of RL baselines.☆10Updated 3 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆13Updated 4 years ago