AlexGrinch / rl_algorithmsLinks

Implementations of different reinforcement learning algorithms

☆10

Alternatives and similar repositories for rl_algorithms

Users that are interested in rl_algorithms are comparing it to the libraries listed below

Sorting:

ChangYong-Oh / HyperSphere
☆16Updated 6 years ago
automl / LTO-CMA
Code for the paper "Learning Step-Size Adaptation in CMA-ES"
☆11Updated 2 years ago
juliagusak / neural-ode-norm
Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"
☆16Updated 5 years ago
stat-ml / dpp-dropout-uncertainty
Effective uncertainty estimation with decorellation and DPP mask for dropout
☆9Updated 2 years ago
vr308 / GPLVM
Implementation of GPLVM and Bayesian GPLVM in pytorch/gpytorch
☆15Updated 4 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
juliagusak / neural-ode-metasolver
Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561
☆25Updated 4 years ago
criteo-research / tf-tile
TF-Tile: an efficient sparse representation for real-valued data
☆14Updated 2 years ago
KMarino / hrl-ep3
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Updated 6 years ago
Daulbaev / IRDM
☆11Updated 4 years ago
Akella17 / Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
☆16Updated 4 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆50Updated 6 years ago
boschresearch / PAC_GP
Implementation of the PAC Bayesian GP learning method.
☆10Updated 6 years ago
duvenaud / additive-gps
Source for experiments in the Additive Gaussian process paper, as well as extensions relating to dropout.
☆22Updated 11 years ago
david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
markm541374 / gpbo
gpbo
☆25Updated 4 years ago
facebookresearch / alebo
Re-Examining Linear Embeddings for High-dimensional Bayesian Optimization
☆41Updated 3 years ago
RobRomijnders / bandit
Implementation of Counterfactual risk minimization
☆26Updated 8 years ago
criteo-research / optimization-continuous-action-crm
☆30Updated 5 years ago
rubinxin / FITBO
Code for Fast Information-theoretic Bayesian Optimisation
☆16Updated 7 years ago
izmailovpavel / TTGP
☆26Updated 7 years ago
sungyubkim / amortized_svgd
A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN
☆19Updated 6 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated 11 months ago
maremun / quffka
Quadrature-based features for kernel approximation
☆16Updated 6 years ago
IlyaTrofimov / bb2vec
This is the source code of the paper "Inferring Complementary Products from Baskets and Browsing Sessions"
☆11Updated 6 years ago
YRussac / WeightedLinearBandits
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Updated 5 years ago
rballester / ttrecipes
Metamodeling, sensitivity analysis and visualization using the tensor train format
☆21Updated 2 years ago
franrruiz / augment-reduce
Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions
☆10Updated 7 years ago
matt-graham / phd-thesis
Auxiliary variable Markov chain Monte Carlo methods
☆10Updated 7 years ago
Riashat / Bayesian-Exploration-Deep-RL
Bayesian Uncertainty Exploration in Deep Reinforcement Learning
☆18Updated 7 years ago