GGUF implementation in C as a library and a tools CLI program
☆327May 16, 2026Updated last week
Alternatives and similar repositories for gguf-tools
Users that are interested in gguf-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fork of llama3.c used to do some R&D on inferencing☆22Dec 20, 2024Updated last year
- ggml implementation of BERT☆500Feb 23, 2024Updated 2 years ago
- First token cutoff sampling inference example☆30Jan 15, 2024Updated 2 years ago
- GGUF parser in Python☆28May 1, 2026Updated 3 weeks ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆201Mar 18, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Node.js module providing inference APIs for large language models, with simple CLI.☆24Dec 7, 2024Updated last year
- GGUF parser for Go☆14Mar 8, 2026Updated 2 months ago
- CLIP inference in plain C/C++ with no extra dependencies☆558Jun 19, 2025Updated 11 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆86Feb 21, 2024Updated 2 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆861Nov 16, 2024Updated last year
- Local ML voice chat using high-end models.☆187Apr 3, 2026Updated last month
- HC-256 Stream cipher in x86 assembly☆19Nov 14, 2017Updated 8 years ago
- Tensor library for machine learning☆14,675Updated this week
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆51Oct 30, 2023Updated 2 years ago
- Julia interface to the 🤗 Hub☆19Apr 24, 2026Updated last month
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆314Apr 11, 2024Updated 2 years ago
- Cross-platform binary launcher with Cosmopolitan libc☆34Apr 12, 2025Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,569Mar 23, 2025Updated last year
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆6,080Updated this week
- A collection of some lockfree datastructures☆80Apr 20, 2023Updated 3 years ago
- ☆129Jan 22, 2024Updated 2 years ago
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- A collection of experiments related to LLM inference with llama.cpp/mlx☆40Updated this week
- A new city of code on a cosmopolitan foundation.☆21Mar 19, 2021Updated 5 years ago
- Inference Llama 2 in one file of pure C☆19,548Aug 6, 2024Updated last year
- FlashAttention (Metal Port)☆601Sep 22, 2024Updated last year
- tsellm: LLMs in SQLite and DuckDB☆26Apr 21, 2025Updated last year
- ☆66Aug 19, 2024Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 4 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated last month
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- ☆17Apr 29, 2024Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Jul 26, 2023Updated 2 years ago
- Create a single-file cross-platform server within an executable ZIP, powered by redbean 🦞☆18Dec 5, 2023Updated 2 years ago