kroggen / mamba.c
View external linksLinks

Inference of Mamba and Mamba2 models in pure C

☆196

Alternatives and similar repositories for mamba.c

Users that are interested in mamba.c are comparing it to the libraries listed below

Sorting:

kroggen / mamba-cpu
View on GitHub
Modified Mamba code to run on CPU
☆30Jan 14, 2024Updated 2 years ago
iamlemec / bert.cpp
View on GitHub
GGML implementation of BERT model with Python bindings and quantization.
☆58Feb 19, 2024Updated last year
flawedmatrix / mamba-ssm
View on GitHub
Implementation of mamba with rust
☆92Mar 9, 2024Updated last year
LegallyCoder / mamba-hf
View on GitHub
Implementation of the Mamba SSM with hf_integration.
☆55Aug 31, 2024Updated last year
catid / spectral_ssm
View on GitHub
Implementation of Spectral State Space Models
☆16Feb 23, 2024Updated last year
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆180May 2, 2024Updated last year
redotvideo / mamba-chat
View on GitHub
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
☆940Mar 3, 2024Updated last year
Maximilian-Winter / llama-cpp-agent
View on GitHub
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆614Feb 17, 2025Updated 11 months ago
ElleLeonne / Lightning-ReLoRA
View on GitHub
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆34Mar 2, 2024Updated last year
gigit0000 / qwen3.cu
View on GitHub
Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.
☆22Nov 26, 2025Updated 2 months ago
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆10Sep 14, 2025Updated 5 months ago
okuvshynov / llama_duo
View on GitHub
asynchronous/distributed speculative evaluation for llama3
☆40Aug 8, 2024Updated last year
mgerstgrasser / tacheles
View on GitHub
a lightweight, open-source blueprint for building powerful and scalable LLM chat applications
☆28Jun 7, 2024Updated last year
staghado / vit.cpp
View on GitHub
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆306Apr 11, 2024Updated last year
jzhang38 / LongMamba
View on GitHub
Some preliminary explorations of Mamba's context scaling.
☆218Feb 8, 2024Updated 2 years ago
mscheong01 / speculative_decoding.c
View on GitHub
minimal C implementation of speculative decoding based on llama2.c
☆25Jul 15, 2024Updated last year
fairydreaming / tlcl
View on GitHub
Simple Tool Caller for llama.cpp
☆11Aug 12, 2024Updated last year
RWKV / rwkv.cpp
View on GitHub
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
☆1,562Mar 23, 2025Updated 10 months ago
Ejb503 / ai-voice-generation
View on GitHub
Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …
☆37Jun 12, 2024Updated last year
alxndrTL / othello_mamba
View on GitHub
Evaluating the Mamba architecture on the Othello game
☆49Apr 25, 2024Updated last year
kotoba-tech / kotomamba
View on GitHub
Mamba training library developed by kotoba technologies
☆71Feb 11, 2024Updated 2 years ago
ggml-org / p1
View on GitHub
LLM-based code completion engine
☆190Jan 23, 2025Updated last year
kyegomez / SimpleMamba
View on GitHub
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
☆40Nov 11, 2024Updated last year
Artefact2 / llm-eval
View on GitHub
A super simple web interface to perform blind tests on LLM outputs.
☆29Mar 9, 2024Updated last year
Oxen-AI / mamba-dive
View on GitHub
This is the code that went into our practical dive using mamba as information extraction
☆57Dec 22, 2023Updated 2 years ago
johnma2006 / mamba-minimal
View on GitHub
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
☆2,918Mar 8, 2024Updated last year
salykova / sgemm.c
View on GitHub
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
☆377Apr 21, 2025Updated 9 months ago
antirez / gguf-tools
View on GitHub
GGUF implementation in C as a library and a tools CLI program
☆303Aug 28, 2025Updated 5 months ago
distantmagic / structured
View on GitHub
Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp
☆45May 16, 2024Updated last year
atomlayer / llama_cute_voice_assistant
View on GitHub
Llama cute voice assistant
☆27Sep 10, 2023Updated 2 years ago
b4rtaz / distributed-llama
View on GitHub
Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
☆2,822Feb 4, 2026Updated last week
pierrel55 / llama_st
View on GitHub
Load and run Llama from safetensors files in C
☆15Oct 24, 2024Updated last year
Cerebras / DocChat
View on GitHub
GPT-4 Level Conversational QA Trained In a Few Hours
☆65Aug 21, 2024Updated last year
KerfuffleV2 / smolrsrwkv
View on GitHub
A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…
☆94Sep 2, 2023Updated 2 years ago
nanowell / Q-Sparse-LLM
View on GitHub
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆33Aug 14, 2024Updated last year
h9-tec / Qwen_MOE_C
View on GitHub
☆42Aug 2, 2025Updated 6 months ago
abetlen / program-constrained-language-model-sampling
View on GitHub
☆35Apr 8, 2023Updated 2 years ago
monatis / clip.cpp
View on GitHub
CLIP inference in plain C/C++ with no extra dependencies
☆552Jun 19, 2025Updated 7 months ago
kyegomez / MambaByte
View on GitHub
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
☆125Feb 6, 2026Updated last week

kroggen / mamba.cView external linksLinks

Alternatives and similar repositories for mamba.c

kroggen / mamba.c
View external linksLinks