RobertTLange / gym-hanoi
A Towers of Hanoi environment in OpenAI Gym Style
☆12Updated 5 years ago
Related projects: ⓘ
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆39Updated last year
- Clockwork VAEs in JAX/Flax☆31Updated 3 years ago
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 3 years ago
- Official implementation of the δ-model presented in the paper "A Distributional Analogue to the Successor Representation".☆12Updated 2 months ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆46Updated 3 years ago
- Baselines for gymnax 🤖☆57Updated last year
- Variational Reinforcement Learning☆16Updated last month
- ☆27Updated 3 years ago
- A Tutorial on Deep Reinforcement Learning in PyTorch☆29Updated last year
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- ☆32Updated last month
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆13Updated 2 months ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆27Updated 3 years ago
- A PyTorch Implementation of Skipper☆20Updated 7 months ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated last year
- ☆27Updated 3 years ago
- ☆15Updated 2 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆21Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 2 years ago
- A Python Toolkit for Managing a Large Number of Experiments☆30Updated 7 months ago
- ☆52Updated 8 months ago
- ☆28Updated 2 years ago
- Sandbox environment for generalizable agent research☆22Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆13Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Updated 5 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆26Updated 4 years ago