RobertTLange / gym-hanoiLinks
A Towers of Hanoi environment in OpenAI Gym Style
☆13Updated 6 years ago
Alternatives and similar repositories for gym-hanoi
Users that are interested in gym-hanoi are comparing it to the libraries listed below
Sorting:
- A curated list of papers presented in the 📖"Flexible Learning Reading Group" @ TU Berlin. Join us! 🤗☆27Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Tutorials on learning and using successor representations.☆52Updated 5 years ago
- A Python Toolkit for Managing a Large Number of Experiments☆32Updated last year
- Baselines for gymnax 🤖☆67Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- ☆86Updated 11 months ago
- ☆123Updated last year
- Library to compare and evaluate reward functions☆67Updated last year
- ☆86Updated 4 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 5 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- A Tutorial on Deep Reinforcement Learning in PyTorch☆32Updated 2 years ago
- krazy grid world☆25Updated 5 years ago
- ☆80Updated last year
- ☆31Updated 6 years ago
- Minimizing Control for Credit Assignment with Strong Feedback☆14Updated 8 months ago
- A framework for experimenting with never-ending learning☆79Updated 9 months ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆38Updated 6 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- Code for "Recurrent Independent Mechanisms"☆118Updated 3 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆82Updated 3 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- ☆84Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning☆223Updated 2 years ago