agwaBom / towards_moe
Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022
☆10Updated 2 years ago
Alternatives and similar repositories for towards_moe:
Users that are interested in towards_moe are comparing it to the libraries listed below
- Official repository for Fourier model that can generate periodic signals☆10Updated 3 years ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated last year
- LISA for ICML 2022☆48Updated 2 years ago
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆41Updated last year
- ☆25Updated 11 months ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Updated 2 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆38Updated last year
- Official implementation for NeurIPS'23 paper "Geodesic Multi-Modal Mixup for Robust Fine-Tuning"☆33Updated 7 months ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated 2 years ago
- Source code for paper "Contrastive Out-of-Distribution Detection for Pretrained Transformers", EMNLP 2021☆40Updated 3 years ago
- ☆30Updated 9 months ago
- Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"☆71Updated last year
- ☆45Updated 2 years ago
- ☆28Updated last year
- ☆38Updated 3 years ago
- Crawl & visualize ICLR papers and reviews☆109Updated 2 years ago
- ☆20Updated 3 years ago
- Use this package to compute intrinsic dimensionality of your task given a fixed neural network in PYTORCH!☆35Updated 2 years ago
- ☆22Updated 10 months ago
- Weighted Training for Cross-Task Learning☆15Updated 2 years ago
- ☆38Updated 5 months ago
- Model Zoos for Continual Learning (ICLR 22)☆45Updated last year
- This repository is the official implementation of Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regulari…☆21Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆14Updated last year
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆50Updated 10 months ago
- Code for the ICLR 2022 paper "Attention-based interpretability with Concept Transformers"☆40Updated 2 years ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated 11 months ago
- ☆66Updated 3 years ago
- ☆17Updated last year