EfficientMoE / MoE-Infinity

PyTorch library for cost-effective, fast and easy serving of MoE models.
165Updated 2 weeks ago

Alternatives and similar repositories for MoE-Infinity:

Users that are interested in MoE-Infinity are comparing it to the libraries listed below