Thin wrapper around GGML to make life easier
☆45Nov 5, 2025Updated 6 months ago
Alternatives and similar repositories for ggml-easy
Users that are interested in ggml-easy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Find out why your CoreML model isn't running on the Neural Engine!☆30Jun 18, 2024Updated last year
- An example of distributed tracing an MCP enabled agent☆15Feb 14, 2026Updated 3 months ago
- Persistent Kernel + JIT-Injected Operators (CUDA)☆47Jan 27, 2026Updated 4 months ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- Markerless AR using OpenCV and OpenGL☆15Nov 16, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FastRTC voice agent☆22Mar 18, 2025Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆14Apr 6, 2025Updated last year
- Cortex-M3 development tree☆15Jan 4, 2015Updated 11 years ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- Sample benchmark demonstrating the VK_NV_cooperative_vector extension☆15Dec 22, 2025Updated 5 months ago
- MiDeCon: Minutia Detection Confidence for Unsupervised and Accurate Minutia and Fingerprint Quality Assessment☆24Mar 18, 2024Updated 2 years ago
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 8 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- ☆20Oct 5, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Mar 12, 2024Updated 2 years ago
- handle gguf files☆13Aug 14, 2025Updated 9 months ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- Community bot☆12Feb 25, 2023Updated 3 years ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆43Aug 3, 2025Updated 9 months ago
- Huggingface deployment for FastHTML☆35Sep 13, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Mar 30, 2023Updated 3 years ago
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆88Updated this week
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆63Sep 17, 2025Updated 8 months ago
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 4 months ago
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆15Apr 23, 2026Updated last month
- An in-process trace collector using the Rust tracing framework and the Perfetto C++ SDK☆16Mar 2, 2026Updated 2 months ago
- ☆63Jul 10, 2025Updated 10 months ago
- (elastic) cuckoo hashing☆17Jun 20, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated last month
- ☆10Dec 8, 2021Updated 4 years ago
- Official Implementation of Deep Image Fingerprint: Accurate And Low Budget Synthetic Image Detector☆22Aug 31, 2023Updated 2 years ago
- ☆14Dec 21, 2025Updated 5 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Feb 18, 2025Updated last year
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆40Jan 5, 2025Updated last year
- An SDK for developing Webex Assistant Skills based on the MindMeld platform.☆13Feb 8, 2023Updated 3 years ago