james-oldfield / muMoE
[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
☆29Updated 6 months ago
Alternatives and similar repositories for muMoE:
Users that are interested in muMoE are comparing it to the libraries listed below
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆28Updated last year
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆19Updated 5 months ago
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆54Updated 3 months ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 2 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 3 years ago
- This repo provides the codebase for "A General Framework for Weak Supervision"☆34Updated 10 months ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆28Updated 3 months ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- ☆13Updated 5 months ago
- Towards Unified and Effective Domain Generalization☆30Updated last year
- [NeurIPS '24] Frustratingly easy Test-Time Adaptation of VLMs!!☆41Updated last week
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- ☆31Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 10 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆58Updated last year
- Official Code for ICML 2023 Paper: On the Generalization of Multi-modal Contrastive Learning