Train, tune, and infer Bamba model
β138May 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for bamba
Users that are interested in bamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FMS Model Optimizer is a framework for developing reduced precision neural network models.β21May 28, 2026Updated 2 weeks ago
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β14Jan 30, 2026Updated 4 months ago
- β33May 26, 2024Updated 2 years ago
- OmegaViT (Ξ©ViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space modβ¦β14Updated this week
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competβ¦β18Aug 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradiβ¦β16May 25, 2026Updated 2 weeks ago
- β39Feb 26, 2024Updated 2 years ago
- PyTorch implementation of models from the Zamba2 series.β193Jan 23, 2025Updated last year
- β17Dec 19, 2024Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on β¦β16Sep 18, 2025Updated 8 months ago
- Code repo for efficient quantized MoE inference with mixture of low-rank compensatorsβ36Apr 14, 2025Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Modelβ57Dec 4, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activatedβ37Aug 14, 2024Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β20Oct 13, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Quantized Attention on GPUβ44Nov 22, 2024Updated last year
- β52Jan 28, 2024Updated 2 years ago
- Python toolsβ14Oct 22, 2023Updated 2 years ago
- Triton implement of bi-directional (non-causal) linear attentionβ76Mar 1, 2026Updated 3 months ago
- HGRN2: Gated Linear RNNs with State Expansionβ57Aug 20, 2024Updated last year
- langchain opentutorial utility package for Open Tutorialβ10Feb 2, 2025Updated last year
- See vLLM official support: https://github.com/vllm-project/vllm-ascendβ11Feb 5, 2025Updated last year
- β137Jun 6, 2025Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!β19Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β26Dec 13, 2024Updated last year
- ANE accelerated embedding models!β19Dec 11, 2024Updated last year
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inferenceβ59Nov 20, 2024Updated last year
- β20Dec 24, 2024Updated last year
- β48Nov 10, 2023Updated 2 years ago
- Google Researchβ47Oct 29, 2022Updated 3 years ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Modelsβ35Jun 12, 2024Updated 2 years ago
- GRadient-INformed MoEβ264Sep 25, 2024Updated last year
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.β13Jul 27, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β157Mar 4, 2025Updated last year
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.β89Mar 27, 2026Updated 2 months ago
- β135Feb 4, 2026Updated 4 months ago
- β212Dec 11, 2024Updated last year
- A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. Itβ¦β44Jan 25, 2025Updated last year
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"β22Oct 14, 2025Updated 8 months ago
- β30Aug 21, 2025Updated 9 months ago