kroggen / mamba.cView external linksLinks
Inference of Mamba and Mamba2 models in pure C
☆196Jan 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for mamba.c
Users that are interested in mamba.c are comparing it to the libraries listed below
Sorting:
- Modified Mamba code to run on CPU☆30Jan 14, 2024Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆58Feb 19, 2024Updated last year
- Implementation of mamba with rust☆92Mar 9, 2024Updated last year
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆940Mar 3, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆614Feb 17, 2025Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated last year
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆22Nov 26, 2025Updated 2 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- asynchronous/distributed speculative evaluation for llama3☆40Aug 8, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆306Apr 11, 2024Updated last year
- Some preliminary explorations of Mamba's context scaling.☆218Feb 8, 2024Updated 2 years ago
- minimal C implementation of speculative decoding based on llama2.c☆25Jul 15, 2024Updated last year
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,562Mar 23, 2025Updated 10 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Evaluating the Mamba architecture on the Othello game☆49Apr 25, 2024Updated last year
- Mamba training library developed by kotoba technologies☆71Feb 11, 2024Updated 2 years ago
- LLM-based code completion engine☆190Jan 23, 2025Updated last year
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆29Mar 9, 2024Updated last year
- This is the code that went into our practical dive using mamba as information extraction☆57Dec 22, 2023Updated 2 years ago
- Simple, minimal implementation of the Mamba SSM in one file of PyTorch.☆2,918Mar 8, 2024Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆377Apr 21, 2025Updated 9 months ago
- GGUF implementation in C as a library and a tools CLI program☆303Aug 28, 2025Updated 5 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45May 16, 2024Updated last year
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,822Feb 4, 2026Updated last week
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Aug 21, 2024Updated last year
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆94Sep 2, 2023Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- ☆42Aug 2, 2025Updated 6 months ago
- ☆35Apr 8, 2023Updated 2 years ago
- CLIP inference in plain C/C++ with no extra dependencies☆552Jun 19, 2025Updated 7 months ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Feb 6, 2026Updated last week