rldotai / mdpyLinks
Markov Decision Processes in Python
☆15Updated 6 years ago
Alternatives and similar repositories for mdpy
Users that are interested in mdpy are comparing it to the libraries listed below
Sorting:
- Reinforcement learning algorithms☆41Updated 6 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 6 years ago
- Cellular automaton-based calculus for the masses☆24Updated 7 years ago
- Reinforcement learning in TensorFlow 2☆22Updated 3 years ago
- A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)☆86Updated 8 months ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 3 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61Updated 4 years ago
- A runtime shape checker and auto-annotator for tensor programs (pronounced "stanley")☆40Updated 6 years ago
- Tensorflow implementation of Synthetic Gradient for RNN (LSTM)☆40Updated 7 years ago
- Deep RL Bootcamp solutions☆34Updated 8 years ago
- Massively Parallel and Asynchronous Architecture for Logic-based AI☆43Updated 2 years ago
- Simple, small, fully-connected Python version of NeoRL☆11Updated 9 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Updated 5 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 7 years ago
- Black box hyperparameter optimization made easy.☆76Updated 2 years ago
- Conditional Associative Logic Memory☆27Updated 8 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Discontinuous Hamiltonian Monte Carlo in JAX☆42Updated 5 years ago
- Flexible Reinforcement Learning Framework with PyTorch☆22Updated 5 years ago
- Alphazero on GPU thanks to CUDA.jl☆32Updated 4 years ago
- Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.☆55Updated 7 years ago
- ☆45Updated 6 years ago
- Sample code for generative recurrent autoencoders.☆26Updated 9 years ago
- Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update☆75Updated last year
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- 2019 talk at GECCO☆68Updated 6 years ago
- ☆16Updated 9 years ago
- Cartesian Genetic Programming for Julia☆69Updated 3 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- Scripts to generate a dataset with static frames from the Arcade Learning Environment☆19Updated 11 years ago