kyegomez / zetaLinks

Build high-performance AI models with modular building blocks

☆559

Alternatives and similar repositories for zeta

Users that are interested in zeta are comparing it to the libraries listed below

Sorting:

kyegomez / Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
☆192Updated last week
kyegomez / MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest…
☆458Updated 3 weeks ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆917Updated 6 months ago
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆576Updated 8 months ago
hkproj / mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
☆173Updated last year
alxndrTL / mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
☆1,353Updated 10 months ago
test-time-training / ttt-lm-jax
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆423Updated last year
kyegomez / MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
☆207Updated last week
Zyphra / BlackMamba
Code repository for Black Mamba
☆258Updated last year
tensorgi / TPA
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
☆418Updated last week
syncdoth / RetNet
Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…
☆226Updated last year
kyegomez / LFM
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
☆193Updated last week
kyegomez / Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much muc…
☆187Updated last week
lucidrains / st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
☆366Updated last year
srush / annotated-mamba
Annotated version of the Mamba paper
☆490Updated last year
radarFudan / Awesome-state-space-models
Collection of papers on state-space models
☆602Updated last month
AvivBick / awesome-ssm-ml
Reading list for research topics in state-space models
☆330Updated 4 months ago
goombalab / hnet
H-Net: Hierarchical Network with Dynamic Chunking
☆764Updated last month
test-time-training / ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆1,271Updated last year
apple / ml-sigmoid-attention
☆302Updated 6 months ago
PeaBrane / mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
☆125Updated last year
HazyResearch / m2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
☆560Updated 10 months ago
NVlabs / hymba
☆201Updated 10 months ago
pengzhangzhi / Awesome-Mamba
Awesome list of papers that extend Mamba to various applications.
☆138Updated 4 months ago
redotvideo / mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
☆933Updated last year
lucidrains / nGPT-pytorch
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
☆291Updated 4 months ago
dingo-actual / infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…
☆292Updated last year
lucidrains / native-sparse-attention-pytorch
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
☆775Updated 2 months ago
jxiw / MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
☆231Updated 2 weeks ago
Jamie-Stirling / RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
☆1,205Updated 2 years ago