AlexGrinch / rl_algorithms
Implementations of different reinforcement learning algorithms
☆10Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for rl_algorithms
- ☆11Updated 3 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 3 years ago
- ☆30Updated 4 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Updated 4 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- Here we will to store papers from bayesgroup.ru☆11Updated 7 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated last year
- This repository contains an implementation of the Batch-BKB algorithm as described in the ICML 2020 paper "Near-linear time Gaussian proc…☆13Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Effective uncertainty estimation with decorellation and DPP mask for dropout☆9Updated last year
- Quadrature-based features for kernel approximation☆16Updated 6 years ago
- Exercises for the Tutorial on Approximate Bayesian Inference at the Data Science Summer School 2018☆22Updated 6 years ago
- ☆16Updated 5 years ago
- Implementation of Counterfactual risk minimization☆26Updated 7 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Updated last year
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Updated 6 years ago
- Code for "Exponential Family Estimation via Adversarial Dynamics Embedding" (NeurIPS 2019)☆13Updated 4 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆36Updated 3 years ago
- ☆47Updated 3 months ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Updated 5 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 6 years ago
- PyTorch implementation of Bidirectional Monte Carlo, Annealed Importance Sampling, and Hamiltonian Monte Carlo.☆52Updated 3 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- This is the source code of the paper "Inferring Complementary Products from Baskets and Browsing Sessions"☆11Updated 5 years ago
- Implementation of the PAC Bayesian GP learning method.☆10Updated 5 years ago
- Auxiliary variable Markov chain Monte Carlo methods☆10Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Quasi-Newton Algorithm for Stochastic Optimization☆10Updated 2 years ago
- Ranking Policy Gradient☆23Updated 4 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 3 years ago