thu-ml / ReMoEView on GitHub
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
106Dec 20, 2024Updated last year

Alternatives and similar repositories for ReMoE

Users that are interested in ReMoE are comparing it to the libraries listed below

Sorting:

Are these results useful?