james-oldfield / muMoE

[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
25Updated last month

Related projects

Alternatives and complementary repositories for muMoE