VietHoang1512 / MT-SGD
Stochastic Multiple Target Sampling Gradient Descent (NeurIPS 2022)
☆13Updated 2 years ago
Alternatives and similar repositories for MT-SGD:
Users that are interested in MT-SGD are comparing it to the libraries listed below
- From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short term preferences of online us…Updated last year
- Distributional Sliced-Wasserstein distance code☆49Updated 8 months ago
- ☆8Updated 3 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆26Updated 2 years ago
- Global-Local Regularization Via Distributional Robustness (AISTATS 2023)☆9Updated last year
- ☆22Updated last year
- ☆38Updated 4 months ago
- A new mini-batch framework for optimal transport in deep generative models, deep domain adaptation, approximate Bayesian computation, col…☆37Updated 2 years ago
- Matching The Statements: A Simple and Accurate Model for Key Point Analysis (ArgMining | EMNLP 2021)☆12Updated 3 years ago
- ☆34Updated last week
- Code for "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" (ICML 2020 - Lifelong Learning Workshop)☆42Updated 2 years ago
- ☆30Updated last year
- ☆44Updated 2 years ago
- Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style☆51Updated 3 years ago
- Causal Discovery via Bayesian Optimization (DrBO) - ICLR 2025☆10Updated last month
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Bayesian Low-Rank Adaptation for Large Language Models☆30Updated 9 months ago
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Updated 2 years ago
- Continual Learning in Low-rank Orthogonal Subspaces (NeurIPS'20)☆36Updated 4 years ago
- Difference-of-Entropies (DoE) Estimator☆25Updated 2 years ago
- Code for "Supermasks in Superposition"☆121Updated last year
- Efficient Conditionally Invariant Representation Learning (ICLR 2023, Oral)☆21Updated 2 years ago
- Robust Learning with the Hilbert-Schmidt Independence Criterion☆45Updated 4 years ago
- [WACV 2024] Domain Generalisation via Risk Distribution Matching☆18Updated 6 months ago
- Weighted Training for Cross-Task Learning☆15Updated 2 years ago
- It is a repo which allows to compute all divergences derived from the theory of entropically regularized, unbalanced optimal transport. I…☆28Updated 2 years ago
- ☆10Updated 6 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆71Updated 10 months ago
- Official PyTorch Implementation for Conflict-Averse Gradient Descent (CAGrad)☆121Updated last year
- ☆14Updated 4 years ago