Thin wrapper around GGML to make life easier
☆45Nov 5, 2025Updated 5 months ago
Alternatives and similar repositories for ggml-easy
Users that are interested in ggml-easy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An example of distributed tracing an MCP enabled agent☆15Feb 14, 2026Updated 2 months ago
- Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.☆42Oct 6, 2025Updated 6 months ago
- Markerless AR using OpenCV and OpenGL☆15Nov 16, 2020Updated 5 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆14Apr 6, 2025Updated last year
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- Gradio Client in Rust.☆29Apr 8, 2026Updated last week
- MiDeCon: Minutia Detection Confidence for Unsupervised and Accurate Minutia and Fingerprint Quality Assessment☆24Mar 18, 2024Updated 2 years ago
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 7 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆42Aug 3, 2025Updated 8 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Mar 30, 2023Updated 3 years ago
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 6 months ago
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 3 months ago
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆14Mar 30, 2026Updated 2 weeks ago
- A Haskell port of the C++ smallpt path tracer☆16Dec 14, 2020Updated 5 years ago
- (elastic) cuckoo hashing☆16Jun 20, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast and memory-efficient exact attention☆20Updated this week
- ☆10Dec 8, 2021Updated 4 years ago
- ☆14Dec 21, 2025Updated 3 months ago
- Official Implementation of Deep Image Fingerprint: Accurate And Low Budget Synthetic Image Detector☆22Aug 31, 2023Updated 2 years ago
- A bot that provides Youtube vid chapters on Twitter (a.k.a. X )☆12Feb 5, 2025Updated last year
- A lightweight Python library for running TTS models with a unified API.☆21Feb 18, 2025Updated last year
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- Get direct links from sharing links for files stored in Yandex.Disk☆14Aug 18, 2013Updated 12 years ago
- A C++ fork/rewrite of the smhasher project to bring Murmurhash v.3 to the Linux shell and to the PHP scripting language.☆21Jul 25, 2011Updated 14 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A CLI tool for managing your locally downloaded Huggingface models and datasets☆35Aug 19, 2025Updated 7 months ago
- Course website for Systems Verification Fall 2024☆14Jul 10, 2025Updated 9 months ago
- ☆40Mar 25, 2023Updated 3 years ago
- This package provides Swift bindings for llama.cpp☆26Apr 4, 2023Updated 3 years ago
- Experiments with BitNet inference on CPU☆56Apr 1, 2024Updated 2 years ago
- A standalone CXL-enabled system simulator.☆21Jan 10, 2026Updated 3 months ago
- In-browser LLM website generator☆51Jan 28, 2025Updated last year