andrewk1 / pytorch-deep-bayesian-banditsLinks
PyTorch port and extension of the Deep Bayesian Bandits Library
☆43Updated 6 years ago
Alternatives and similar repositories for pytorch-deep-bayesian-bandits
Users that are interested in pytorch-deep-bayesian-bandits are comparing it to the libraries listed below
Sorting:
- Code for "Neural causal learning from unknown interventions"☆104Updated 5 years ago
- Code for "Systematic Generalization: What Is Required and Can It Be Learned"☆37Updated 6 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 3 years ago
- Code for "A Meta Transfer Objective For Learning To Disentangle Causal Mechanisms"☆127Updated 6 years ago
- Collection of algorithms for approximating Fisher Information Matrix for Natural Gradient (and second order method in general)☆142Updated 6 years ago
- Code for "Recurrent Independent Mechanisms"☆120Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- ☆82Updated 5 years ago
- References at the Intersection of Causality and Reinforcement Learning☆90Updated 5 years ago
- Train a simple convnet on the MNIST dataset and evaluate the BALD acquisition function☆16Updated 8 years ago
- Retrieve information from DBLP and update BibTex files automatically☆54Updated 3 years ago
- Automatically Composing Representation Transformations as a Means for Generalization☆24Updated 6 years ago
- Explaining a black-box using Deep Variational Information Bottleneck Approach☆46Updated 3 years ago
- ☆30Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- ☆89Updated last year
- ☆80Updated 2 years ago
- Multiplicative Normalizing Flow (MNF) posteriors for variational Bayesian neural networks☆65Updated 5 years ago
- Measuring compositionality in representation learning☆73Updated 6 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆35Updated 9 years ago
- Semi-synthetic experiments to test several approaches for off-policy evaluation and optimization of slate recommenders.☆43Updated 8 years ago
- Github page for the preprint paper "InfoCatVAE: Representation Learning with Categorical Variational Autoencoders"☆14Updated 5 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 5 years ago
- Computing various norms/measures on over-parametrized neural networks☆50Updated 7 years ago
- Stein Variational Policy Gradient for REINFORCE☆18Updated 8 years ago
- a python implementation of various versions of the information bottleneck, including automated parameter searching☆131Updated 5 years ago
- Ranking Policy Gradient☆23Updated 6 years ago
- TBA☆77Updated 6 years ago
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆43Updated 4 years ago