Miffyli / rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
☆16Updated 3 years ago
Related projects: ⓘ
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆34Updated last year
- GPT implementation in Flax☆18Updated 2 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆26Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆16Updated 3 years ago
- Combining NEAT and novelty search to quickly generate diverse video game levels (GECCO 2022). https://arxiv.org/abs/2204.06934☆14Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆26Updated last year
- ☆28Updated 2 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- Understanding RL vision Distill article☆23Updated last year
- A2C is a special case of PPO!☆19Updated 2 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆25Updated 2 years ago
- ☆22Updated 2 years ago
- Variational Reinforcement Learning☆16Updated last month
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Co-evolution of agents and environments in GVG-AI☆16Updated 3 years ago
- ☆42Updated 2 years ago
- ☆11Updated 3 months ago
- Shared MuJoCo simulation scenes and assets for ROBEL environments.☆11Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆27Updated 2 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆11Updated 3 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- ☆15Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated last year