GGUF implementation in C as a library and a tools CLI program
☆336May 16, 2026Updated 3 weeks ago
Alternatives and similar repositories for gguf-tools
Users that are interested in gguf-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Some random tools for working with the GGUF file format☆32Nov 24, 2023Updated 2 years ago
- A fork of llama3.c used to do some R&D on inferencing☆22Dec 20, 2024Updated last year
- ggml implementation of BERT☆500Feb 23, 2024Updated 2 years ago
- GGUF parser in Python☆29May 1, 2026Updated last month
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Node.js module providing inference APIs for large language models, with simple CLI.☆25Dec 7, 2024Updated last year
- CLIP inference in plain C/C++ with no extra dependencies☆560Jun 19, 2025Updated 11 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Feb 21, 2024Updated 2 years ago
- Recreation of the BBC News Map that allows for quick selection of counties and towns☆23Oct 19, 2011Updated 14 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆864Nov 16, 2024Updated last year
- Local ML voice chat using high-end models.☆188Jun 4, 2026Updated last week
- Tensor library for machine learning☆14,804Updated this week
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆314Apr 11, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Cross-platform binary launcher with Cosmopolitan libc☆35Apr 12, 2025Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,572Mar 23, 2025Updated last year
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆6,197Jun 7, 2026Updated last week
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- Fixed-point scalar and matrix multiplication library for SectorLISP☆15Jan 23, 2022Updated 4 years ago
- ☆129Jan 22, 2024Updated 2 years ago
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A collection of experiments related to LLM inference with llama.cpp/mlx☆40Updated this week
- A new city of code on a cosmopolitan foundation.☆21Mar 19, 2021Updated 5 years ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,933Jun 4, 2026Updated last week
- Inference Llama 2 in one file of pure C☆19,604Aug 6, 2024Updated last year
- FlashAttention (Metal Port)☆605Sep 22, 2024Updated last year
- tsellm: LLMs in SQLite and DuckDB☆26Apr 21, 2025Updated last year
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆1,114Jun 1, 2026Updated last week
- ☆67Aug 19, 2024Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 5 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 2 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Example of using SDL2 with Cosmopolitan Libc☆39Mar 20, 2024Updated 2 years ago
- ☆17Apr 29, 2024Updated 2 years ago