VietHoang1512 / MT-SGD
Stochastic Multiple Target Sampling Gradient Descent (NeurIPS 2022)
☆13Updated 2 years ago
Alternatives and similar repositories for MT-SGD:
Users that are interested in MT-SGD are comparing it to the libraries listed below
- From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short term preferences of online us…☆0Updated 10 months ago
- ☆9Updated 3 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆26Updated 2 years ago
- Distributional Sliced-Wasserstein distance code☆49Updated 5 months ago
- Global-Local Regularization Via Distributional Robustness (AISTATS 2023)☆9Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆19Updated 3 years ago
- A new mini-batch framework for optimal transport in deep generative models, deep domain adaptation, approximate Bayesian computation, col…☆37Updated last year
- ☆38Updated 2 months ago
- ☆22Updated last year
- ☆108Updated 2 years ago
- Belief matching framework official implementation☆39Updated last year
- Code for Environment Inference for Invariant Learning (ICML 2021 Paper)☆50Updated 3 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- Robust Learning with the Hilbert-Schmidt Independence Criterion☆44Updated 4 years ago
- PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"☆74Updated 4 years ago
- Deep Learning & Information Bottleneck☆53Updated last year
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆68Updated 8 months ago
- ☆25Updated 8 months ago
- An implementation of Maximum Mean Discrepancy (MMD) as a differentiable loss in PyTorch.☆30Updated 2 years ago
- ☆43Updated 2 years ago
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆147Updated 2 years ago
- Code for "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" (ICML 2020 - Lifelong Learning Workshop)☆42Updated 2 years ago
- ☆13Updated 4 years ago
- ☆30Updated last year
- Code used in "Understanding Dimensional Collapse in Contrastive Self-supervised Learning" paper.☆76Updated 2 years ago
- Example code of Sparse Gaussian Process Attention (ICLR 2023)☆22Updated 6 months ago
- A curated list of papers and resources about the distribution shift in machine learning.☆107Updated last year
- Code for NeurIPS 2020 Paper --- Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks☆21Updated 2 years ago
- Continual Learning in Low-rank Orthogonal Subspaces (NeurIPS'20)☆37Updated 4 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆54Updated last year