kyegomez / VortexFusionLinks
Transformers + Mambas + LSTMS All in One Model
☆14Updated 3 weeks ago
Alternatives and similar repositories for VortexFusion
Users that are interested in VortexFusion are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated last week
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆26Updated 3 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆14Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆15Updated 2 years ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆41Updated 4 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆24Updated 3 weeks ago
- Implementation of Infini-Transformer in Pytorch☆113Updated 10 months ago
- ☆50Updated 9 months ago
- ☆18Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆114Updated last month
- ☆16Updated last year
- RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering☆27Updated last month
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- A repository for DenseSSMs☆89Updated last year
- Geometric-Mean Policy Optimization☆92Updated this week
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆122Updated last year
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆32Updated last year
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation, arXiv 2024☆64Updated last month
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆61Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆49Updated 6 months ago
- State Space Models☆71Updated last year
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆73Updated 3 months ago
- A RL env with procedurally generated symbolic reasoning data☆29Updated 3 weeks ago
- ☆67Updated 7 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆56Updated 3 weeks ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆16Updated 8 months ago
- ☆40Updated 5 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 5 months ago
- ☆51Updated 9 months ago