JohnnyYeeee / math_prog_synth_envLinks
☆12Updated 4 years ago
Alternatives and similar repositories for math_prog_synth_env
Users that are interested in math_prog_synth_env are comparing it to the libraries listed below
Sorting:
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Updated 5 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Updated 3 years ago
- Code for Dataset and Benchmarks Submission, Neurips 2022☆13Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Updated 2 years ago
- ☆17Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- ☆22Updated 4 years ago
- ☆57Updated last year
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 4 years ago
- Performant, differentiable reinforcement learning☆23Updated 2 years ago
- ☆30Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- Variational Reinforcement Learning☆17Updated last year
- A modular implementation of PPO, and soon hopefully other algorithms.☆26Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆36Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 3 years ago
- ☆28Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".☆60Updated last year
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 4 years ago
- ☆28Updated 5 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Updated 4 years ago
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆10Updated 5 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Updated 5 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated last month
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆11Updated 6 years ago