Train, tune, and infer Bamba model
β138May 15, 2026Updated last month
Alternatives and similar repositories for bamba
Users that are interested in bamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FMS Model Optimizer is a framework for developing reduced precision neural network models.β21Jun 24, 2026Updated last week
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β14Jan 30, 2026Updated 5 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Modeβ¦β126Sep 13, 2024Updated last year
- β33May 26, 2024Updated 2 years ago
- OmegaViT (Ξ©ViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space modβ¦β15Jun 22, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competβ¦β18Aug 28, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradiβ¦β16Jun 22, 2026Updated last week
- β39Feb 26, 2024Updated 2 years ago
- PyTorch implementation of models from the Zamba2 series.β193Jan 23, 2025Updated last year
- β17Dec 19, 2024Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on β¦β16Sep 18, 2025Updated 9 months ago
- Code repo for efficient quantized MoE inference with mixture of low-rank compensatorsβ38Apr 14, 2025Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Modelβ57Dec 4, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activatedβ37Aug 14, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β20Oct 13, 2025Updated 8 months ago
- Quantized Attention on GPUβ44Nov 22, 2024Updated last year
- β52Jan 28, 2024Updated 2 years ago
- Python toolsβ14Oct 22, 2023Updated 2 years ago
- Triton implement of bi-directional (non-causal) linear attentionβ76Mar 1, 2026Updated 4 months ago
- HGRN2: Gated Linear RNNs with State Expansionβ57Aug 20, 2024Updated last year
- Cray-LM unified training and inference stack.β22Jan 30, 2025Updated last year
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"β69Apr 11, 2025Updated last year
- β139Jun 6, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A swarm of LLM agents that will help you test, document, and productionize your code!β19Jun 22, 2026Updated last week
- β25Dec 13, 2024Updated last year
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inferenceβ58Nov 20, 2024Updated last year
- β20Dec 24, 2024Updated last year
- β48Nov 10, 2023Updated 2 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Modelsβ35Jun 12, 2024Updated 2 years ago
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.β13Jul 27, 2025Updated 11 months ago
- β157Mar 4, 2025Updated last year
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.β89Mar 27, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β138Feb 4, 2026Updated 5 months ago
- β212Dec 11, 2024Updated last year
- A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. Itβ¦β44Jan 25, 2025Updated last year
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"β22Oct 14, 2025Updated 8 months ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.β12Jun 19, 2026Updated 2 weeks ago
- β30Aug 21, 2025Updated 10 months ago
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)β32Apr 9, 2025Updated last year