andrewk1 / pytorch-deep-bayesian-banditsLinks
PyTorch port and extension of the Deep Bayesian Bandits Library
☆43Updated 6 years ago
Alternatives and similar repositories for pytorch-deep-bayesian-bandits
Users that are interested in pytorch-deep-bayesian-bandits are comparing it to the libraries listed below
Sorting:
- Code for "Neural causal learning from unknown interventions"☆104Updated 5 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 3 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆80Updated 2 years ago
- Code for "A Meta Transfer Objective For Learning To Disentangle Causal Mechanisms"☆127Updated 7 years ago
- ☆80Updated 2 years ago
- Code for "Recurrent Independent Mechanisms"☆120Updated 3 years ago
- References at the Intersection of Causality and Reinforcement Learning☆90Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 6 years ago
- ☆30Updated 5 years ago
- Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot expe…☆148Updated 2 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆142Updated 6 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆61Updated 5 years ago
- Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses☆194Updated 2 years ago
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆45Updated 5 years ago
- ☆61Updated 2 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 5 years ago
- Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"☆17Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- ☆79Updated 5 years ago
- ☆89Updated last year
- ☆172Updated last year
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆61Updated 6 years ago
- Pip-installable differentiable stacks in PyTorch!☆65Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- a python implementation of various versions of the information bottleneck, including automated parameter searching☆132Updated 5 years ago
- learning to search in pytorch☆110Updated 5 years ago
- Train a simple convnet on the MNIST dataset and evaluate the BALD acquisition function☆16Updated 8 years ago