GGUF implementation in C as a library and a tools CLI program
☆311Aug 28, 2025Updated 6 months ago
Alternatives and similar repositories for gguf-tools
Users that are interested in gguf-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A small utility library for parsing GGUF file info☆29Jan 27, 2025Updated last year
- Some random tools for working with the GGUF file format☆31Nov 24, 2023Updated 2 years ago
- A fork of llama3.c used to do some R&D on inferencing☆22Dec 20, 2024Updated last year
- ggml implementation of BERT☆497Feb 23, 2024Updated 2 years ago
- First token cutoff sampling inference example☆30Jan 15, 2024Updated 2 years ago
- GGUF parser in Python☆28Aug 15, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆199Updated this week
- GGUF parser for Go☆14Mar 8, 2026Updated 2 weeks ago
- CLIP inference in plain C/C++ with no extra dependencies☆554Jun 19, 2025Updated 9 months ago
- Port of Meta's Encodec in C/C++☆228Dec 4, 2024Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆86Feb 21, 2024Updated 2 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆857Nov 16, 2024Updated last year
- Recreation of the BBC News Map that allows for quick selection of counties and towns☆23Oct 19, 2011Updated 14 years ago
- Tensor library for machine learning☆14,252Mar 16, 2026Updated last week
- HC-256 Stream cipher in x86 assembly☆19Nov 14, 2017Updated 8 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Oct 30, 2023Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆310Apr 11, 2024Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,562Mar 23, 2025Updated last year
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆5,591Mar 16, 2026Updated last week
- Cross-platform binary launcher with Cosmopolitan libc☆34Apr 12, 2025Updated 11 months ago
- Fixed-point scalar and matrix multiplication library for SectorLISP☆15Jan 23, 2022Updated 4 years ago
- ☆128Jan 22, 2024Updated 2 years ago
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- asynchronous/distributed speculative evaluation for llama3☆39Aug 8, 2024Updated last year
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- A new city of code on a cosmopolitan foundation.☆21Mar 19, 2021Updated 5 years ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆1,016Dec 17, 2025Updated 3 months ago
- Inference Llama 2 in one file of pure C☆19,302Aug 6, 2024Updated last year
- FlashAttention (Metal Port)☆593Sep 22, 2024Updated last year
- ☆65Aug 19, 2024Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 7 months ago
- Implementation of ModernBERT in MLX☆20Jan 7, 2026Updated 2 months ago
- Temporary mail - Keep your real mailbox clean and secure. Temp Mail provides temporary, secure, anonymous, free, disposable email address…☆13Mar 17, 2023Updated 3 years ago
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆839Updated this week
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆118Feb 12, 2024Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago