kyegomez / VortexFusionLinks
Transformers + Mambas + LSTMS All in One Model
☆12Updated 2 weeks ago
Alternatives and similar repositories for VortexFusion
Users that are interested in VortexFusion are comparing it to the libraries listed below
Sorting:
- ☆18Updated 11 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated last week
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆26Updated 2 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆13Updated 11 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆15Updated 2 years ago
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆108Updated last week
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆110Updated 2 weeks ago
- State Space Models