LLM inference in C/C++
☆87Apr 7, 2026Updated this week
Alternatives and similar repositories for llama.cpp
Users that are interested in llama.cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Aug 21, 2024Updated last year
- ☆18Nov 25, 2022Updated 3 years ago
- MacOS dragging helper☆12Mar 31, 2024Updated 2 years ago
- Took the neural style transfer for python and made it way more user friendly.☆15Dec 14, 2019Updated 6 years ago
- Concurrent data extraction from unstructured text and images using AI models.☆18Aug 10, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- A gRPC-Web implementation for Python☆14Jan 28, 2025Updated last year
- ☆13Mar 13, 2023Updated 3 years ago
- ☆16Feb 6, 2024Updated 2 years ago
- ☆12Mar 4, 2025Updated last year
- Remove duplicates from your Pocket list.☆16Jan 1, 2022Updated 4 years ago
- FLUX inspired Mann-E model☆14Oct 24, 2024Updated last year
- Exploring how optimizations for GEMMs work☆30Feb 28, 2026Updated last month
- With a few words and a click of a button, quickly get an engaging, high quality video. (And optionally save and share it!)☆19May 4, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An interactive Rust learning platform featuring progressive exercises aligned with "The Rust Programming Language" book.☆22Dec 8, 2025Updated 4 months ago
- ☆11Aug 10, 2021Updated 4 years ago
- [ACL 2025] NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering☆26Jul 29, 2025Updated 8 months ago
- An implementation of LLMzip using GPT-2☆13Aug 7, 2023Updated 2 years ago
- Telemetry otel instrumentation - Go, HTTP, Kubernetes☆16Feb 24, 2025Updated last year
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆24Sep 21, 2025Updated 6 months ago
- ☆18Jun 18, 2025Updated 9 months ago
- Cog wrapper for PASD Magnify☆17Jan 8, 2024Updated 2 years ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆27Dec 17, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Generate massive fake datasets for your datalake, fast. By SOMA☆20Mar 11, 2026Updated 3 weeks ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆61Dec 26, 2025Updated 3 months ago
- ComfyUI unofficial implementation of Thera - Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆36Jan 2, 2026Updated 3 months ago
- Various LLM Benchmarks☆24Feb 20, 2026Updated last month
- ☆19Apr 3, 2025Updated last year
- Generate a server from SQL☆22Mar 7, 2025Updated last year
- Бенчмарк для оценки способности языковых моделей решать математические и физические задачи на русском языке☆22Nov 14, 2025Updated 4 months ago
- A minimal clipboard history tool for macOS — inspired by Clipy, optimized for speed and simplicity.☆32Mar 31, 2026Updated last week
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆30Feb 19, 2025Updated last year
- Collaborative Culture Community Policy: Zero Tolerance☆25May 21, 2023Updated 2 years ago
- ☆35May 1, 2023Updated 2 years ago
- Habrahabr Enhancement Suite☆16Mar 6, 2020Updated 6 years ago
- Generate text with recurrent neural nets☆21Jul 23, 2017Updated 8 years ago
- T5-based (russian) text normalization☆26Jan 25, 2024Updated 2 years ago
- ☆49May 20, 2025Updated 10 months ago