GGUF implementation in C as a library and a tools CLI program
☆342May 16, 2026Updated last month
Alternatives and similar repositories for gguf-tools
Users that are interested in gguf-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A small utility library for parsing GGUF file info☆30Jan 27, 2025Updated last year
- ggml implementation of BERT☆501Feb 23, 2024Updated 2 years ago
- GGUF parser in Python☆29May 1, 2026Updated 2 months ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 3 months ago
- GGUF parser for Go☆14Mar 8, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CLIP inference in plain C/C++ with no extra dependencies☆563Jun 19, 2025Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Feb 21, 2024Updated 2 years ago
- Recreation of the BBC News Map that allows for quick selection of counties and towns☆23Oct 19, 2011Updated 14 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆866Nov 16, 2024Updated last year
- Local ML voice chat using high-end models.☆188Jun 4, 2026Updated last month
- Tensor library for machine learning☆14,871Jun 19, 2026Updated 2 weeks ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆51Oct 30, 2023Updated 2 years ago
- Julia interface to the 🤗 Hub☆19May 30, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆317Apr 11, 2024Updated 2 years ago
- Cross-platform binary launcher with Cosmopolitan libc☆35Apr 12, 2025Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,574Mar 23, 2025Updated last year
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆6,398Jun 26, 2026Updated last week
- Fixed-point scalar and matrix multiplication library for SectorLISP☆15Jan 23, 2022Updated 4 years ago
- ☆129Jan 22, 2024Updated 2 years ago
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆135Nov 9, 2024Updated last year
- A new city of code on a cosmopolitan foundation.☆21Mar 19, 2021Updated 5 years ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,971Jun 27, 2026Updated last week
- Inference Llama 2 in one file of pure C☆19,682Aug 6, 2024Updated last year
- FlashAttention (Metal Port)☆611Sep 22, 2024Updated last year
- tsellm: LLMs in SQLite and DuckDB☆26Apr 21, 2025Updated last year
- ☆67Aug 19, 2024Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 10 months ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Temporary mail - Keep your real mailbox clean and secure. Temp Mail provides temporary, secure, anonymous, free, disposable email address…☆12Mar 17, 2023Updated 3 years ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆121Feb 12, 2024Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 2 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- A chat UI for Llama.cpp☆16Jun 4, 2026Updated last month
- ☆17Apr 29, 2024Updated 2 years ago