hyhuang00 / moe_inference

Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".
16Updated 5 months ago

Alternatives and similar repositories for moe_inference:

Users that are interested in moe_inference are comparing it to the libraries listed below