james-oldfield/muMoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/james-oldfield/muMoE)

james-oldfield / muMoE

[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

☆41

Alternatives and similar repositories for muMoE

Users that are interested in muMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

james-oldfield / MxD
View on GitHub
[NeurIPS'25] Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
☆16May 28, 2025Updated last year
roymiles / VeLoRA
View on GitHub
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆22Oct 15, 2024Updated last year
NickyFot / ACMMM22_LearningLabelRelationships
View on GitHub
☆11Jun 20, 2023Updated 3 years ago
james-oldfield / PandA
View on GitHub
[ICLR'23] Code to reproduce the results in the paper "PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs"
☆58Jun 8, 2023Updated 3 years ago
zengqunzhao / AIM-Fair
View on GitHub
[CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
☆18Mar 27, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ChuanyangZheng / L2ViT
View on GitHub
Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer
☆15Sep 7, 2024Updated last year
KingJamesSong / latent-flow
View on GitHub
NeurIPS23 "Flow Factorized Representation Learning"
☆46Dec 15, 2025Updated 7 months ago
MrChenFeng / Adaptive-Soft-Contrastive-Learning_ICPR2022
View on GitHub
ASCL: adpative Soft Contrastive Learning (ICPR2022)
☆22Mar 22, 2025Updated last year
alexandrosXe / A-Simple-Baseline-For-Knowledge-Based-VQA
View on GitHub
Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"
☆25Dec 14, 2023Updated 2 years ago
kostas1515 / GOL
View on GitHub
[ECCV2022] Gumbel Optimised Loss for Long Tailed Instance Segmentation.
☆18Nov 24, 2022Updated 3 years ago
KingJamesSong / RankFeat
View on GitHub
NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension
☆20Feb 21, 2025Updated last year
NickyFot / HitchhikersGuide
View on GitHub
Official repository of "A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning" published in NeurIPS'20…
☆12Feb 7, 2025Updated last year
roymiles / Simple-Recipe-Distillation
View on GitHub
[AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation
☆20Feb 13, 2024Updated 2 years ago
chi0tzp / ContraCLIP
View on GitHub
Authors official PyTorch implementation of the "ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences".
☆42Oct 1, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
chi0tzp / FALCO
View on GitHub
[CVPR 2023, top-10%] Authors official PyTorch implementation of the "Attribute-preserving Face Dataset Anonymization via Latent Code Opti…
☆77Aug 8, 2024Updated last year
nikosips / UDON
View on GitHub
☆11Nov 18, 2024Updated last year
roymiles / VkD
View on GitHub
[CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections
☆59Oct 21, 2024Updated last year
roymiles / ITRD
View on GitHub
[BMVC 2022] Information Theoretic Representation Distillation
☆19Oct 6, 2023Updated 2 years ago
WalterSimoncini / no-train-all-gain
View on GitHub
Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"
☆11Oct 31, 2024Updated last year
aradha / deep_neural_feature_ansatz
View on GitHub
Code for verifying deep neural feature ansatz
☆22May 3, 2023Updated 3 years ago
WangYZ1608 / Knowledge-Distillation-via-ND
View on GitHub
The official implementation for paper: Improving Knowledge Distillation via Regularizing Feature Norm and Direction
☆24Aug 3, 2023Updated 2 years ago
autodriving-heart / Awesome-4D-Radar
View on GitHub
Awesome-4D-Radar
☆12Feb 17, 2024Updated 2 years ago
nubot-nudt / UGNA-VPR
View on GitHub
☆12Mar 28, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Maryeon / whiten_mtd
View on GitHub
Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"
☆11Dec 20, 2023Updated 2 years ago
YuCao16 / CRDI
View on GitHub
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
☆16Mar 14, 2025Updated last year
wookiekim / HCCNet
View on GitHub
Official PyTorch implementation of HCCNet: Efficient Semantic Matching with Hypercolumn Correlation (WACV '24 Oral, Best paper finalist (…
☆11Apr 29, 2024Updated 2 years ago
Newbeeer / orthogonal_classifier
View on GitHub
Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"
☆35Jun 6, 2023Updated 3 years ago
kmk97 / Fast-Loc-NeRF
View on GitHub
[ICRA 2025] Fast Global Localization on Neural Radiance Field
☆17Jun 30, 2025Updated last year
BinuxLiu / NocPlace
View on GitHub
Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer
☆14Dec 3, 2024Updated last year
ai4ce / NYC-Indoor-VPR
View on GitHub
☆16Dec 18, 2025Updated 7 months ago
chi0tzp / PyVGGFace
View on GitHub
VGG-Face CNN descriptor in PyTorch.
☆38Jan 21, 2021Updated 5 years ago
NBoulle / RationalNets
View on GitHub
Code for the paper "Rational neural networks", NeurIPS 2020
☆30Feb 15, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mingzeG / DropCov
View on GitHub
Implementation of DropCov as described in DropCov: A Simple yet Effective Method for Improving Deep Architectures
☆10Oct 15, 2022Updated 3 years ago
google-research / silc
View on GitHub
[ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation
☆48Oct 3, 2024Updated last year
jlevy44 / PathFlow-MixMatch
View on GitHub
Don't mix, match! Simple utilities for improved registration of Histopathology Whole Slide Images.
☆11Oct 11, 2020Updated 5 years ago
amaralibey / GPM
View on GitHub
Official repository for BMVC 2022 paper: Global Proxy-based Hard Mining for Visual Place Recognition
☆18Mar 7, 2023Updated 3 years ago
bramtoula / vdna
View on GitHub
Pytorch implementation of Visual DNA, an approach to represent and compare images.
☆41Feb 14, 2024Updated 2 years ago
Westlake-AI / A2MIM
View on GitHub
[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
☆32Aug 15, 2024Updated last year
moukamisama / Recon
View on GitHub
☆12Apr 18, 2023Updated 3 years ago