SJTU-DENG-Lab / AdaMoEView on GitHub
[Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
20Oct 2, 2024Updated last year

Alternatives and similar repositories for AdaMoE

Users that are interested in AdaMoE are comparing it to the libraries listed below

Sorting:

Are these results useful?