Miffyli / rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
☆15Updated 3 years ago
Alternatives and similar repositories for rl-human-prior-tricks:
Users that are interested in rl-human-prior-tricks are comparing it to the libraries listed below
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆21Updated 4 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Gym wrapper for pysc2☆10Updated 2 years ago
- ☆15Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- A TF2.0 implementation of RL baselines.☆10Updated 3 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- Variational Reinforcement Learning☆16Updated 9 months ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆36Updated last year
- ☆16Updated 4 years ago
- ☆20Updated 5 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 11 months ago
- ☆23Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Episodic Control☆19Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 5 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆13Updated 3 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Updated 8 months ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆26Updated 3 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 3 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Updated 4 years ago
- ☆26Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago