AlexGrinch / rl_algorithmsLinks
Implementations of different reinforcement learning algorithms
☆10Updated 6 years ago
Alternatives and similar repositories for rl_algorithms
Users that are interested in rl_algorithms are comparing it to the libraries listed below
Sorting:
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- ☆16Updated 6 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 6 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Updated 5 years ago
- ☆30Updated 4 years ago
- Implementation of Riemannian optimization for skip-gram negative sampling (ACL 2017)☆19Updated 7 years ago
- ☆15Updated 2 years ago
- ☆11Updated 4 years ago
- Implementation of the PAC Bayesian GP learning method.☆10Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Updated 7 years ago
- Variational Reinforcement Learning☆16Updated 10 months ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- Auxiliary variable Markov chain Monte Carlo methods☆10Updated 7 years ago
- ☆12Updated 4 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Updated 6 years ago
- Code to minimize the Variational Contrastive Divergence (VCD)☆29Updated 6 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- My homework solutions for UC Berkeley CS294: deep unsupervised learning☆14Updated 2 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆38Updated 4 years ago
- Scalable GP Adapter for Time Series Classification☆13Updated 7 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- Дипломная раб ота бакалавра / Bachelor thesis☆11Updated 9 years ago
- ☆83Updated 2 years ago
- ☆26Updated 6 years ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- ☆13Updated 2 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago