JohnnyYeeee / math_prog_synth_env
☆12Updated 3 years ago
Alternatives and similar repositories for math_prog_synth_env
Users that are interested in math_prog_synth_env are comparing it to the libraries listed below
Sorting:
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Updated 5 years ago
- Variational Reinforcement Learning☆16Updated 9 months ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆23Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- ☆28Updated 2 years ago
- ☆20Updated 5 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- A minimal implementation of Go-Explore without domain knowledge☆15Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆19Updated 3 months ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Generalised UDRL☆37Updated 3 years ago
- Fully differentiable RL environments, written in Ivy.☆65Updated last year
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 4 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- krazy grid world☆25Updated 5 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- ☆27Updated 4 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 3 years ago
- ☆29Updated 4 years ago
- GPT implementation in Flax☆18Updated 3 years ago