AlexGrinch / rl_algorithmsLinks
Implementations of different reinforcement learning algorithms
☆10Updated 7 years ago
Alternatives and similar repositories for rl_algorithms
Users that are interested in rl_algorithms are comparing it to the libraries listed below
Sorting:
- ☆17Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆24Updated 2 years ago
- Here we will to store papers from bayesgroup.ru☆11Updated 8 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- Auxiliary variable Markov chain Monte Carlo methods☆10Updated 7 years ago
- ☆82Updated 2 years ago
- Scalable GP Adapter for Time Series Classification☆13Updated 8 years ago
- Scalable Log Determinants for Gaussian Process Kernel Learning (https://arxiv.org/abs/1711.03481) (NIPS 2017)☆18Updated 7 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Updated 2 years ago
- Public repository for the work on bandit problems☆23Updated last year
- ☆30Updated 5 years ago
- gpbo☆25Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- A clean TensorFlow implementation of Concrete Dropout☆22Updated 7 years ago
- ☆23Updated 5 years ago
- PyTorch implementation of Bidirectional Monte Carlo, Annealed Importance Sampling, and Hamiltonian Monte Carlo.☆52Updated 4 years ago
- ☆25Updated 7 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆23Updated 5 years ago
- Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"☆39Updated 7 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆34Updated 9 years ago
- Multiplicative Normalizing Flow (MNF) posteriors for variational Bayesian neural networks☆65Updated 5 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27Updated 6 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Updated 4 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Updated 7 years ago
- ☆50Updated last year
- a deep recurrent model for exchangeable data☆34Updated 5 years ago
- Python notebooks and slides for CE9010: Introduction to Data Science, Semester 2 2017/18☆52Updated 7 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- Stochastic Gradient Riemannian Langevin Dynamics☆34Updated 10 years ago