microsoft / SparseMixerView on GitHub
Sparse Backpropagation for Mixture-of-Expert Training
29Jul 2, 2024Updated last year

Alternatives and similar repositories for SparseMixer

Users that are interested in SparseMixer are comparing it to the libraries listed below

Sorting:

Are these results useful?