rldotai / mdpyLinks
Markov Decision Processes in Python
☆15Updated 6 years ago
Alternatives and similar repositories for mdpy
Users that are interested in mdpy are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms☆41Updated 6 years ago
- Cellular automaton-based calculus for the masses☆24Updated 7 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 6 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 6 months ago
- presentations☆44Updated 6 years ago
- Conditional Associative Logic Memory☆27Updated 7 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- Tensorflow implementation of Synthetic Gradient for RNN (LSTM)☆40Updated 7 years ago
- An introduction to the basic ideas of commutative algebra☆17Updated 5 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆42Updated 5 years ago
- Bayesian Inference and parameter estimation in quant finance.☆43Updated 6 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- Black box hyperparameter optimization made easy.☆75Updated 2 years ago
- Simple, small, fully-connected Python version of NeoRL☆11Updated 9 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Replication of Uber Neuroevolution paper☆46Updated 7 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Quasi-Newton Algorithm for Stochastic Optimization☆10Updated 3 years ago
- A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")☆40Updated 5 years ago
- Deep RL Bootcamp solutions☆34Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 7 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- A forest that is fast☆41Updated 6 years ago
- A probabilistic programming language, based on Church☆17Updated 7 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Updated 4 years ago
- The Path to Nash Equilibrium☆38Updated 2 years ago
- A neural assembly compiler for pyTorch based on adaptive-neural-compilation☆27Updated 7 years ago