rldotai / mdpyLinks
Markov Decision Processes in Python
☆15Updated 6 years ago
Alternatives and similar repositories for mdpy
Users that are interested in mdpy are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms☆41Updated 6 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 7 months ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 6 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Cellular automaton-based calculus for the masses☆24Updated 7 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆42Updated 5 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- Quick definitions and intuitive explanations around machine learning.☆37Updated 2 months ago
- Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update☆75Updated 11 months ago
- Modeling agents with probabilistic programs☆67Updated 6 years ago
- ☆29Updated 3 years ago
- List of awesome JAX resources☆13Updated 2 years ago
- OpenAI's cartpole env solver.☆18Updated 6 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- ☆48Updated 5 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆190Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 8 years ago
- Replication of Uber Neuroevolution paper☆46Updated 7 years ago
- Conditional Associative Logic Memory☆27Updated 7 years ago
- Parameter Importance Analysis Tool☆77Updated 4 years ago
- Evolution Strategy Library☆55Updated 5 years ago
- Probabilistic Programming eXecution protocol (PPX)☆75Updated 3 years ago
- Blazingly fast capsule networks in 75 lines of pytorch+einops☆26Updated 4 years ago