rldotai / mdpyLinks

Markov Decision Processes in Python

☆15

Alternatives and similar repositories for mdpy

Users that are interested in mdpy are comparing it to the libraries listed below

Sorting:

rldotai / rl-algorithms
Reinforcement learning algorithms
☆41Updated 6 years ago
csxeba / trickster
Reinforcement learning in TensorFlow 2
☆22Updated 3 years ago
sergio-hcsoft / FractalAI
Cellular automaton-based calculus for the masses
☆24Updated 7 years ago
cgreer / alpha-zero-boosted
A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)
☆86Updated 4 months ago
pmlg / deep-rl-bootcamp
Deep RL Bootcamp solutions
☆35Updated 7 years ago
cair / fast-tsetlin-machine-in-cuda-with-imdb-demo
A CUDA implementation of the Tsetlin Machine based on bitwise operators
☆26Updated 5 years ago
mtrazzi / gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆18Updated 7 years ago
facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
edwardjhu / improved_wasserstein
Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"
☆14Updated 5 years ago
mhlr / awesome-jax
List of awesome JAX resources
☆13Updated 2 years ago
PartnershipOnAI / safelife
SafeLife: safety benchmarks for reinforcement learning agents
☆61Updated 4 years ago
learningtopredict / learningtopredict.github.io
☆28Updated 3 years ago
cjratcliff / ml-compiled
Quick definitions and intuitive explanations around machine learning.
☆35Updated last year
mtomassoli / information-theory-tutorial
☆16Updated 8 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
hannw / sgrnn
Tensorflow implementation of Synthetic Gradient for RNN (LSTM)
☆40Updated 7 years ago
hardmaru / gecco-tutorial-2019
2019 talk at GECCO
☆68Updated 6 years ago
ethanluoyc / dhmc-jax
Discontinuous Hamiltonian Monte Carlo in JAX
☆42Updated 5 years ago
deepppl / deepppl
Deep Probabilistic Programming Language
☆19Updated last year
goktug97 / estorch
Evolution Strategy Library
☆55Updated 5 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
ofnote / tsanley
A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")
☆40Updated 5 years ago
d9w / CartesianGeneticProgramming.jl
Cartesian Genetic Programming for Julia
☆69Updated 3 years ago
lansiz / eqpt
The Path to Nash Equilibrium
☆38Updated 2 years ago
rotinov / AGNES
Flexible Reinforcement Learning Framework with PyTorch
☆22Updated 5 years ago
cair / PyTsetlinMachineCUDA
Massively Parallel and Asynchronous Architecture for Logic-based AI
☆42Updated 2 years ago
mtrazzi / spinning-up-a-Pong-AI-with-deep-RL
Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.
☆54Updated 6 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 5 years ago
BY571 / Upside-Down-Reinforcement-Learning
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.
☆77Updated 4 years ago
tpbarron / pytorch-a2c
Simple change of a3c to a2c
☆15Updated 8 years ago