AlexGrinch / rl_algorithms
Implementations of different reinforcement learning algorithms
☆10Updated 6 years ago
Alternatives and similar repositories for rl_algorithms:
Users that are interested in rl_algorithms are comparing it to the libraries listed below
- ☆16Updated 6 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Updated 7 years ago
- ☆11Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Implementation of Riemannian optimization for skip-gram negative sampling (ACL 2017)☆19Updated 6 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- Comparison of bandit algorithms from the Reinforcement Learning bible.☆17Updated 6 years ago
- TF-Tile: an efficient sparse representation for real-valued data☆14Updated 2 years ago
- Quadrature-based features for kernel approximation☆16Updated 6 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Updated 4 years ago
- Code for Fast Information-theoretic Bayesian Optimisation☆16Updated 6 years ago
- NeurIPS 2018: AI for Prosthetics Challenge – 3rd place solution☆32Updated 5 years ago
- Auxiliary variable Markov chain Monte Carlo methods☆10Updated 7 years ago
- Graph Nets in pytorch☆27Updated 2 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Updated 6 years ago
- Variational Reinforcement Learning☆16Updated 8 months ago
- Here we will to store papers from bayesgroup.ru☆11Updated 8 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- Discrete Object Generation with Reversible Inductive Construction (NeurIPS 2019)☆30Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Updated 2 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22Updated 5 years ago
- ☆50Updated 8 months ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 7 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆13Updated 5 years ago