goombalab / phi-mamba
View external linksLinks

Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)
119Sep 13, 2024Updated last year

Alternatives and similar repositories for phi-mamba

Users that are interested in phi-mamba are comparing it to the libraries listed below

Sorting:

Are these results useful?