rldotai / mdpy
Markov Decision Processes in Python
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for mdpy
- Reinforcement learning algorithms☆40Updated 5 years ago
- An attempt to reimplement experiments from the 2013 paper by Wissner-Gross & Freer☆10Updated this week
- Reinforcement Learning Assembly☆92Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆45Updated 5 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 2 years ago
- Convenient hyperparameter optimization☆14Updated 6 months ago
- Neural model of hierarchical reinforcement learning☆16Updated 7 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- ☆16Updated 7 years ago
- PhD Publications and Thesis on LASSO Model Predictive Control☆20Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Cellular automaton-based calculus for the masses☆24Updated 6 years ago
- Tensorflow implementation of Neural Arithmetic Logic Unit, Trask et al.☆29Updated 6 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆40Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 3 years ago
- ☆42Updated 5 years ago
- The Path to Nash Equilibrium☆38Updated last year
- ☆27Updated 2 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆11Updated 3 years ago
- Improving spiking dynamical networks: Accurate delays, higher-order synapses, and time cells☆7Updated 7 years ago
- Understanding RL vision Distill article☆23Updated last year
- Flexible Reinforcement Learning Framework with PyTorch☆22Updated 4 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 5 years ago
- PyTorch implementation of DARLA preprocessing models☆11Updated 6 years ago
- Scripts to generate a dataset with static frames from the Arcade Learning Environment☆18Updated 10 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 4 years ago