AlexGrinch / rl_algorithms
Implementations of different reinforcement learning algorithms
☆10Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for rl_algorithms
- ☆30Updated 4 years ago
- ☆16Updated 5 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Updated 4 years ago
- ☆11Updated 3 years ago
- Implementation of Riemannian optimization for skip-gram negative sampling (ACL 2017)☆19Updated 6 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Updated 5 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 3 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 6 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- Implementation of the PAC Bayesian GP learning method.☆10Updated 5 years ago
- Implementation of GPLVM and Bayesian GPLVM in pytorch/gpytorch☆15Updated 3 years ago
- Ranking Policy Gradient☆23Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Re-Examining Linear Embeddings for High-dimensional Bayesian Optimization☆41Updated 3 years ago
- Non-stationary Off-policy Evaluation☆13Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated last year
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 4 years ago
- Source for experiments in the Additive Gaussian process paper, as well as extensions relating to dropout.☆21Updated 10 years ago
- Code to minimize the Variational Contrastive Divergence (VCD)☆27Updated 5 years ago
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆13Updated 4 years ago
- ☆81Updated last year
- Estimators to perform off-policy evaluation☆13Updated 2 months ago
- Here we will to store papers from bayesgroup.ru☆11Updated 7 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.☆14Updated 4 years ago
- The main github repository for NLA2015 course☆12Updated 8 years ago
- Quadrature-based features for kernel approximation☆16Updated 6 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆33Updated 8 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago