VowpalWabbit / reinforcement_learningLinks

Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update

☆75

Alternatives and similar repositories for reinforcement_learning

Users that are interested in reinforcement_learning are comparing it to the libraries listed below

Sorting:

microsoft / statopt
Statistical adaptive stochastic optimization methods
☆32Updated 5 years ago
pmlg / deep-rl-bootcamp
Deep RL Bootcamp solutions
☆35Updated 7 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
opium-sh / prl
Open-source library for a reinforcement learning research.
☆54Updated 2 years ago
RobertTLange / reading-notes-ml
Progress, Notes, Summaries and a lot of Questions on Machine Learning
☆55Updated 5 years ago
PartnershipOnAI / safelife
SafeLife: safety benchmarks for reinforcement learning agents
☆61Updated 4 years ago
google / deluca
Performant, differentiable reinforcement learning
☆122Updated this week
microsoft / coax
This project was moved to: https://github.com/coax-dev/coax
☆160Updated 2 years ago
hardmaru / mdn_jax_tutorial
Mixture Density Networks (Bishop, 1994) tutorial in JAX
☆60Updated 5 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
mcleonard / xaby
Functional machine learning for fun
☆85Updated 4 years ago
VowpalWabbit / coba
Contextual bandit benchmarking
☆50Updated last month
hardmaru / gecco-tutorial-2019
2019 talk at GECCO
☆68Updated 6 years ago
Cohere-Labs-Community / rl
Generic reinforcement learning codebase in TensorFlow
☆95Updated 3 years ago
pronobis / libspn-keras
Library for learning and inference with Sum-product Networks utilizing TensorFlow 2.x and Keras
☆48Updated 4 years ago
udellgroup / oboe
An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.
☆83Updated 3 years ago
facebookresearch / dagger
Experiment orchestration
☆103Updated 5 years ago
kyunghyuncho / backprop-kalman-filter
☆45Updated 5 years ago
isl-org / LMRS
Source code for ICLR 2020 paper: "Learning to Guide Random Search"
☆40Updated 11 months ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
gomerudo / nas-env
A simple OpenAI Gym environment for Neural Architecture Search (NAS)
☆30Updated 5 years ago
facebookresearch / rela
Reinforcement Learning Assembly
☆92Updated 3 years ago
Feryal / rl_mlss_2020
☆53Updated 5 years ago
ericjang / maml-jax
Implementation of Model-Agnostic Meta-Learning (MAML) in Jax
☆191Updated 2 years ago
iurteaga / bandits
Public repository for the work on bandit problems
☆23Updated last year
0xangelo / raylab
Reinforcement learning algorithms in RLlib
☆59Updated last year
david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
BY571 / Upside-Down-Reinforcement-Learning
Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.
☆77Updated 4 years ago
yngtdd / hyperspace
Distributed Bayesian Optimization
☆23Updated 5 years ago
goktug97 / estorch
Evolution Strategy Library
☆55Updated 5 years ago