rldotai / mdpyLinks
Markov Decision Processes in Python
☆15Updated 6 years ago
Alternatives and similar repositories for mdpy
Users that are interested in mdpy are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms☆41Updated 6 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 6 years ago
- Cellular automaton-based calculus for the masses☆24Updated 7 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 8 months ago
- ☆29Updated 3 years ago
- Conditional Associative Logic Memory☆27Updated 8 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆42Updated 5 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- An interface with micropolis for city-building agents, packaged as an OpenAI gym environment☆158Updated 7 months ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Evolution Strategy Library☆55Updated 5 years ago
- Deep RL Bootcamp solutions☆34Updated 7 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")☆40Updated 6 years ago
- presentations☆44Updated 6 years ago
- Quick definitions and intuitive explanations around machine learning.☆37Updated 2 months ago
- Tensorflow implementation of Synthetic Gradient for RNN (LSTM)☆40Updated 7 years ago
- Bayesian Inference and parameter estimation in quant finance.☆43Updated 6 years ago
- Replication of Uber Neuroevolution paper☆46Updated 7 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- 2019 talk at GECCO☆68Updated 6 years ago
- Black box hyperparameter optimization made easy.☆76Updated 2 years ago
- A forest that is fast☆42Updated 6 years ago
- My solution for the competition "Le meilleur data scientist de France 2018" (Best Data Scientist of France 2018)☆12Updated 7 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago