zhougroup / BAMLinks
Bayesian Attention Modules
☆36Updated 5 years ago
Alternatives and similar repositories for BAM
Users that are interested in BAM are comparing it to the libraries listed below
Sorting:
- Official implementation of Transformer Neural Processes☆78Updated 3 years ago
- An Empirical Study of Invariant Risk Minimization☆27Updated 5 years ago
- ☆38Updated 5 years ago
- Codes for Causal Semantic Generative model (CSG), the model proposed in "Learning Causal Semantic Representation for Out-of-Distribution …☆77Updated 3 years ago
- ☆111Updated 3 years ago
- Robust Learning with the Hilbert-Schmidt Independence Criterion☆49Updated 5 years ago
- ☆16Updated 2 years ago
- Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style☆54Updated 4 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 3 years ago
- Experiments to reproduce results in Interventional Causal Representation Learning.☆28Updated 2 years ago
- Code to reproduce the results for Compositional Attention☆59Updated 3 years ago
- This is the code for the paper Embrace the Gap: VAEs perform Independent Mechanism Analysis, showing that optimizing the ELBO is equivale…☆23Updated last year
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 4 years ago
- This is reimplementation of "Simple and Principled Uncertainty Estimation with Deterministic Deep Learning via Distance Awareness" in Pyt…☆52Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 4 years ago
- Noise Contrastive Estimation (NCE) in PyTorch☆32Updated 10 months ago
- ☆21Updated 5 years ago
- ☆20Updated 4 years ago
- ☆45Updated 3 years ago
- Code accompanying paper: Meta-Learning to Improve Pre-Training☆37Updated 4 years ago
- How certain is your transformer?☆25Updated 4 years ago
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Updated 4 years ago
- Posterior Network: Uncertainty Estimation without OOD Samples via Density-Based Pseudo-Counts (Neurips 2020)☆78Updated 3 years ago
- Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxi…☆68Updated 4 years ago
- Featurized Density Ratio Estimation☆20Updated 4 years ago
- ☆39Updated last year
- ☆31Updated 4 years ago
- Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…☆17Updated 5 years ago
- Pytorch implementation of neural processes and variants☆29Updated last year
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆69Updated 3 years ago