Implementation of mamba with rust
☆94Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-ssm
Users that are interested in mamba-ssm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆201Mar 18, 2026Updated 2 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Apr 8, 2024Updated 2 years ago
- LLM inference in Fortran☆63May 30, 2024Updated last year
- win32 native frontend for llama-cli☆14Nov 2, 2024Updated last year
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆37Aug 14, 2024Updated last year
- ☆19Jan 3, 2024Updated 2 years ago
- GPT-2 small trained on phi-like data☆68Feb 18, 2024Updated 2 years ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- ☆70Mar 1, 2024Updated 2 years ago
- PanGenome Graph Building with the first 100 assemblies from the 1000G ONT Sequencing Consortium☆12Apr 5, 2025Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- ☆30Feb 27, 2024Updated 2 years ago
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆41Feb 23, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- GPT-4 Level Conversational QA Trained In a Few Hours☆68Aug 21, 2024Updated last year
- Copy a bunch of files into your clipboard to provide context for LLMs☆113Feb 8, 2026Updated 3 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the Eleu…☆15Jul 24, 2023Updated 2 years ago
- ☆54Nov 22, 2024Updated last year
- Rust binding for WFA2-lib☆10Jun 7, 2022Updated 3 years ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆55Mar 25, 2025Updated last year
- ☆26Feb 26, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆44Jul 4, 2025Updated 10 months ago
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆107Oct 14, 2025Updated 7 months ago
- High-Performance Text Deduplication Toolkit☆61Aug 25, 2025Updated 9 months ago
- ☆12Apr 4, 2024Updated 2 years ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆21May 18, 2026Updated last week
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- BGUI stands for BOOPSI Graphical User Interface. BGUI is free GUI toolkit for the Amiga OS.☆11Apr 29, 2025Updated last year
- Constrained Decoding of Diffusion LLMs with Context-Free Grammars.☆48Dec 17, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A pure NumPy implementation of Mamba.☆222Jul 8, 2024Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 7 months ago
- ☆32Jan 7, 2024Updated 2 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- A lightweight chat terminal-interface for llama.cpp server written in C++ with many features and windows/linux support.☆27Mar 31, 2026Updated last month
- a functional parody of Stack Overflow, using AI☆10Apr 4, 2026Updated last month
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆940Mar 3, 2024Updated 2 years ago