Taishi-N324 / Drop-UpcyclingView on GitHub
[ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
25Oct 5, 2025Updated 5 months ago

Alternatives and similar repositories for Drop-Upcycling

Users that are interested in Drop-Upcycling are comparing it to the libraries listed below

Sorting:

Are these results useful?