JohnnyYeeee / math_prog_synth_envLinks
☆12Updated 3 years ago
Alternatives and similar repositories for math_prog_synth_env
Users that are interested in math_prog_synth_env are comparing it to the libraries listed below
Sorting:
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Updated 5 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- ☆28Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆33Updated 2 years ago
- ☆20Updated 5 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Reinforcement learning library in JAX.☆100Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- flexible meta-learning in jax☆14Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆19Updated 4 months ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Updated 4 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- ☆36Updated 2 years ago
- ☆16Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 8 months ago
- PAIRED in PyTorch 🔥☆60Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- Fully differentiable RL environments, written in Ivy.☆65Updated last year
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Updated 4 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year