AlexGrinch / rl_algorithmsLinks
Implementations of different reinforcement learning algorithms
☆10Updated 7 years ago
Alternatives and similar repositories for rl_algorithms
Users that are interested in rl_algorithms are comparing it to the libraries listed below
Sorting:
- ☆17Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 7 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆24Updated 2 years ago
- Implementation of Riemannian optimization for skip-gram negative sampling (ACL 2017)☆19Updated 7 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- Official implementation of the paper "Approximating two value functions instead of one: towards characterizing a new family of Deep Reinf…☆11Updated 4 years ago
- Ranking Policy Gradient☆23Updated 6 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Updated 7 years ago
- gpbo☆25Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆35Updated 9 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 7 years ago
- Auxiliary variable Markov chain Monte Carlo methods☆10Updated 8 years ago
- ☆26Updated 7 years ago
- ☆50Updated last year
- Public repository for the work on bandit problems☆24Updated last year
- ☆42Updated 7 years ago
- ☆30Updated 5 years ago
- PyTorch implementation of Bidirectional Monte Carlo, Annealed Importance Sampling, and Hamiltonian Monte Carlo.☆52Updated 4 years ago
- Here we will to store papers from bayesgroup.ru☆11Updated 9 years ago
- A2C for GVG-AI☆22Updated 7 years ago
- a deep recurrent model for exchangeable data☆34Updated 5 years ago
- ☆31Updated 7 years ago
- The Differentiable Cross-Entropy Method☆124Updated 5 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆40Updated last year
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Updated 4 years ago
- Python notebooks and slides for CE9010: Introduction to Data Science, Semester 2 2017/18☆52Updated 7 years ago
- ☆36Updated 7 years ago
- A clean TensorFlow implementation of Concrete Dropout☆22Updated 8 years ago