PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆72Aug 22, 2023Updated 2 years ago
Alternatives and similar repositories for soft-moe
Users that are interested in soft-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆347Apr 2, 2025Updated last year
- ☆724Jun 6, 2026Updated last week
- ☆23Oct 22, 2025Updated 7 months ago
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated last year
- arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.