andrewk1 / pytorch-deep-bayesian-bandits
PyTorch port and extension of the Deep Bayesian Bandits Library
☆42Updated 5 years ago
Alternatives and similar repositories for pytorch-deep-bayesian-bandits:
Users that are interested in pytorch-deep-bayesian-bandits are comparing it to the libraries listed below
- Code for "Neural causal learning from unknown interventions"☆99Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Updated 4 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Scalable Training of Inference Networks for Gaussian-Process Models, ICML 2019☆41Updated 2 years ago
- Implementation of the paper "Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory", Ron Amit and Ron Meir, ICML 2018☆22Updated 5 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 5 years ago
- Automatically Composing Representation Transformations as a Means for Generalization☆24Updated 5 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆138Updated 5 years ago
- ☆13Updated 5 years ago
- ☆43Updated 5 years ago
- Implementation of the Functional Neural Process models☆43Updated 4 years ago
- ☆68Updated 6 years ago
- ☆85Updated 7 months ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- learning to search in pytorch☆110Updated 5 years ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆42Updated 4 years ago
- Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses☆184Updated last year
- A Python implementation of the gradient REBAR estimator.☆46Updated 6 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 7 years ago
- Autoregressive Energy Machines☆77Updated 2 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- a python implementation of various versions of the information bottleneck, including automated parameter searching☆123Updated 4 years ago
- ☆12Updated 7 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 4 years ago
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Updated 6 years ago
- Code for Self-Tuning Networks (ICLR 2019) https://arxiv.org/abs/1903.03088☆53Updated 5 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆188Updated 2 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago