rldotai / mdpy
Markov Decision Processes in Python
☆15Updated 6 years ago
Alternatives and similar repositories for mdpy:
Users that are interested in mdpy are comparing it to the libraries listed below
- Reinforcement learning algorithms☆40Updated 6 years ago
- Tensorflow implementation of Neural Arithmetic Logic Unit, Trask et al.☆29Updated 6 years ago
- ☆43Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- ☆27Updated 3 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- Tensorflow implementation of Synthetic Gradient for RNN (LSTM)☆40Updated 7 years ago
- Cellular automaton-based calculus for the masses☆24Updated 6 years ago
- Simple change of a3c to a2c☆15Updated 7 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Simple, small, fully-connected Python version of NeoRL☆11Updated 9 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 2 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Replication of Uber Neuroevolution paper☆46Updated 6 years ago
- An attempt to reimplement the 2013 paper by Wissner-Gross & Freer☆10Updated 3 months ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Made for a reading group at the Center for Safe AGI.☆11Updated 2 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Flexible Reinforcement Learning Framework with PyTorch☆22Updated 4 years ago
- Graph Nets in pytorch☆27Updated 2 years ago
- NeurIPS 2019 Paper Implementation☆12Updated 2 years ago
- ☆22Updated 6 years ago
- Convenient hyperparameter optimization☆14Updated 10 months ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆41Updated 5 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago