kyegomez / VortexFusionLinks
Transformers + Mambas + LSTMS All in One Model
☆14Updated last week
Alternatives and similar repositories for VortexFusion
Users that are interested in VortexFusion are comparing it to the libraries listed below
Sorting:
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Updated last year
- ☆50Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated last week
- A repository for DenseSSMs☆88Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated 2 years ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Updated last week
- We study toy models of skill learning.☆31Updated this week
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17Updated 8 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆39Updated last year
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆28Updated 5 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- State Space Models☆72Updated last year
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated 3 months ago
- ☆18Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Updated 10 months ago
- ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory☆45Updated 2 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆35Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆40Updated 2 years ago
- ☆13Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 9 months ago
- ☆17Updated 6 months ago
- Multimodal Graph Learning: how to encode multiple multimodal neighbors with their relations into LLMs☆67Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- Implementation of Infini-Transformer in Pytorch☆112Updated last year
- When Reasoning Meets Its Laws☆35Updated last month
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆80Updated 6 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Updated last year
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Updated last week