goombalab / phi-mamba

Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)
92Updated 4 months ago

Alternatives and similar repositories for phi-mamba:

Users that are interested in phi-mamba are comparing it to the libraries listed below