SJTU-DENG-Lab / AdaMoEView on GitHub
[Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models
20Oct 2, 2024Updated last year

Alternatives and similar repositories for AdaMoE

Users that are interested in AdaMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?