wrmedford / moe-scaling

Scaling Laws for Mixture of Experts Models
10Updated 2 months ago

Alternatives and similar repositories for moe-scaling

Users that are interested in moe-scaling are comparing it to the libraries listed below

Sorting: