Pytorch (Lightning) implementation of the Mamba model
☆37Apr 18, 2025Updated last year
Alternatives and similar repositories for mamba
Users that are interested in mamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Retention-Network in PyTorch☆17Aug 12, 2023Updated 2 years ago
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆184Aug 12, 2024Updated last year
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21May 12, 2026Updated last week
- A simple and efficient Mamba implementation in pure PyTorch and MLX.☆1,460May 3, 2026Updated 3 weeks ago
- Code example for pretraining an LLM with vanilla PyTorch training loop☆10Jun 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ITMO AI Talent Hub Speech Recognition and Generation course☆13Apr 16, 2026Updated last month
- ☆21May 23, 2024Updated 2 years ago
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- Official Implementations "Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference" for DiT (NeurIPS'24)☆15Aug 3, 2025Updated 9 months ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- ☆22May 26, 2025Updated 11 months ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆126May 11, 2026Updated 2 weeks ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jan 22, 2025Updated last year
- ☆13Jul 23, 2024Updated last year
- This is the code that went into our practical dive using mamba as information extraction☆57Dec 22, 2023Updated 2 years ago
- ☆10Jun 10, 2023Updated 2 years ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆29Mar 22, 2026Updated 2 months ago
- ☆10May 1, 2023Updated 3 years ago
- ☆12Dec 23, 2024Updated last year
- 免注册免费使用 ChatGPT,请关注微信公众号【胖竹同学】。☆10Apr 4, 2023Updated 3 years ago
- An automated feature engineering framework 'FETCH' accepted in ICLR 2023.☆11Jun 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Sep 22, 2025Updated 8 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25May 11, 2026Updated 2 weeks ago
- ☆12Apr 19, 2024Updated 2 years ago
- Minimal repository to demonstrate fast LoRA inference with Flux family of models.☆32Jul 23, 2025Updated 10 months ago
- ☆13Mar 25, 2023Updated 3 years ago
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆68Jun 19, 2025Updated 11 months ago
- [ICML 2024] PyTorch implementation for "Diversified Batch Selection for Training Acceleration"☆10Jul 30, 2024Updated last year
- [WACV 2026] CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading☆36Jan 21, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆131Aug 4, 2025Updated 9 months ago
- ZMQ-based framework for building Pub-Sub Systems, written in Python 3.☆15Aug 8, 2018Updated 7 years ago
- Associative scan package for DRYing some code between repos☆18Jan 5, 2026Updated 4 months ago
- ☆23Nov 23, 2025Updated 6 months ago
- The official PyTorch implementation of IEEE Transactions on Image Processing 2021 paper "Rethinking the U-shape Structure for Salient Obj…☆20Dec 1, 2022Updated 3 years ago
- Minimal implementation of TokenFormer for inference and learning☆13Nov 6, 2024Updated last year
- ☆15Aug 15, 2023Updated 2 years ago