Modified Mamba code to run on CPU
☆30Jan 14, 2024Updated 2 years ago
Alternatives and similar repositories for mamba-cpu
Users that are interested in mamba-cpu are comparing it to the libraries listed below
Sorting:
- Inference of Mamba and Mamba2 models in pure C☆197Jan 22, 2026Updated last month
- Dynamically controllable Llama-model LLM inference in macOS with MLX☆18Feb 8, 2025Updated last year
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 4 months ago
- LLM inference in Fortran☆64May 30, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Feb 6, 2026Updated last month
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated 2 weeks ago
- LCM Drawing app☆12Dec 1, 2023Updated 2 years ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41May 24, 2024Updated last year
- manim practice, using manim ce☆11Mar 3, 2022Updated 4 years ago
- Kotlin library for Cortex.cpp a Local AI API Platform that is used to run and customize LLMs.☆10Apr 2, 2025Updated 11 months ago
- Raspberry Pi 4 Image☆12Oct 25, 2024Updated last year
- ☆17Dec 26, 2025Updated 2 months ago
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated last month
- rdiv!(::AbstractMatrix, ::UpperTriangular) and ldiv!(::LowerTriangular, ::AbstractMatrix)☆12Nov 18, 2024Updated last year
- sherpa-onnx Go package for Windows☆13Feb 28, 2026Updated last week
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 3 weeks ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 7 months ago
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Updated this week
- Yet another `llama.cpp` Rust wrapper☆12Jun 19, 2024Updated last year
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Use GSI's geoid model of Japan (JPGEO2024, GSIGEO2011) in Rust, Python and JavaScript — 国土地理院の日本のジオイドモデルを用いてジオイド高を計算する Rust、Python、JavaSc…☆14Dec 8, 2025Updated 2 months ago
- ☆11Apr 25, 2024Updated last year
- ☆11Jun 25, 2024Updated last year
- AADL models for the Crazyflie UAV -- OMSCS Class CS7639☆11Mar 22, 2020Updated 5 years ago
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆23Dec 15, 2025Updated 2 months ago
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 6 months ago
- The Lily programming language ⚜☆10Jan 4, 2026Updated 2 months ago
- ☆30Dec 12, 2025Updated 2 months ago
- Parallel Simulated annealing in GPU using CUDA (used for floorplanning problem)☆12Jun 4, 2020Updated 5 years ago
- Exploring better compiler architecture☆10Feb 24, 2026Updated last week
- A minimal CLI tool for piping anything into an LLM.☆18Jan 1, 2026Updated 2 months ago
- Keras like network builder for Chainer☆11Oct 22, 2017Updated 8 years ago
- Inference slice of marian for bergamot's tiny11 models. Faster to compile, and wield. Fewer model-archs than bergamot-translator.☆13Oct 24, 2024Updated last year