kyegomez / VortexFusionLinks

Transformers + Mambas + LSTMS All in One Model

☆14

Alternatives and similar repositories for VortexFusion

Users that are interested in VortexFusion are comparing it to the libraries listed below

Sorting:

kyegomez / TTL
Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"
☆25Updated last week
tum-ai / number-token-loss
A regression-alike loss to improve numerical reasoning in language models - ICML 2025
☆26Updated 3 months ago
kyegomez / HSSS
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…
☆14Updated last year
kyegomez / MultiQueryAttention
This is a simple torch implementation of the high performance Multi-Query Attention
☆15Updated 2 years ago
VITA-Group / o1-planning
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
☆41Updated 4 months ago
kyegomez / MC-ViT
Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
☆24Updated 3 weeks ago
lucidrains / infini-transformer-pytorch
Implementation of Infini-Transformer in Pytorch
☆113Updated 10 months ago
Weixin-Liang / Mixture-of-Mamba
☆50Updated 9 months ago
radarFudan / mamba
☆18Updated last year
kyegomez / MoE-Mamba
Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…
☆114Updated last month
yale-nlp / refdpo
☆16Updated last year
chenzhongwu20 / RuleRAG_ICL_FT
RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering
☆27Updated last month
kyegomez / EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆19Updated last year
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆37Updated last year
WailordHe / DenseSSM
A repository for DenseSSMs
☆89Updated last year
callsys / GMPO
Geometric-Mean Policy Optimization
☆92Updated this week
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆122Updated last year
uclaml / MoE
Towards Understanding the Mixture-of-Experts Layer in Deep Learning
☆32Updated last year
SHI-Labs / VisPer-LM
[NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation, arXiv 2024
☆64Updated last month
ByungKwanLee / Phantom
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆61Updated last year
microsoft / x-reasoner
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains
☆49Updated 6 months ago
badripatro / mamba360
State Space Models
☆71Updated last year
gersteinlab / ChemAgent
[ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590
☆73Updated 3 months ago
sileod / reasoning_core
A RL env with procedurally generated symbolic reasoning data
☆29Updated 3 weeks ago
du-nlp-lab / MLR-Copilot
☆67Updated 7 months ago
kyegomez / Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆56Updated 3 weeks ago
tianyi-lab / R2-T2
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆16Updated 8 months ago
metal-chart-generation / metal
☆40Updated 5 months ago
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆57Updated 5 months ago
LAMDASZ-ML / Self-Backtracking
☆51Updated 9 months ago