Run LLaMA inference on CPU, with Rust π¦ππ¦
β34Jan 5, 2025Updated last year
Alternatives and similar repositories for llama-rs
Users that are interested in llama-rs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another `llama.cpp` Rust wrapperβ12Jun 19, 2024Updated last year
- One-Click RAG Implementation, Simple and Portableβ30Oct 5, 2025Updated 6 months ago
- Inference Llama3.2 1B/3B base/instruct models in 1 file of pure Cβ22Jul 22, 2025Updated 8 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rustβ40Aug 2, 2023Updated 2 years ago
- LLama.cpp rust bindingsβ419Jun 27, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Solana Airdrop Faucet: A simple web application that allows users to receive free SOL tokens on the Solana Devnet. Built with Next.js, thβ¦β11Sep 22, 2024Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.β25Sep 1, 2025Updated 7 months ago
- Build a simple CMD chat interface with llama.cpp and C++β14Sep 19, 2025Updated 6 months ago
- Marvell Armada 38x U-Boot supportβ10Mar 21, 2019Updated 7 years ago
- A thumbnail creation libraryβ10Apr 14, 2024Updated 2 years ago
- Non-blocking sockets wrapperβ17Aug 23, 2025Updated 7 months ago
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.β24Mar 26, 2026Updated 3 weeks ago
- "Unescapes" strings with escape sequences written with literal characters and converts it into a properly escaped one.β11Mar 22, 2020Updated 6 years ago
- OpenVPN zero-copy parser written in pure rustβ21Sep 13, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Rust+OpenCL+AVX2 implementation of LLaMA inference codeβ555Feb 12, 2024Updated 2 years ago
- A library for Base85 encoding as described in RFC1924β11Mar 14, 2026Updated last month
- β22Updated this week
- LIVA - Local Intelligent Voice Assistantβ61Aug 28, 2024Updated last year
- Ready-to-use widgets for the Floem GUI library on Windows, Mac and Linux.β13Jan 28, 2024Updated 2 years ago
- Build a mobile cross-platform project template based on uni-app uni-app-matrix-adminβ10Nov 13, 2021Updated 4 years ago
- My own attempt at bringing the Chromium Embedded Framework to Rust via its C APIβ26Aug 5, 2020Updated 5 years ago
- β11Feb 27, 2023Updated 3 years ago