Thin wrapper around GGML to make life easier
☆46Nov 5, 2025Updated 7 months ago
Alternatives and similar repositories for ggml-easy
Users that are interested in ggml-easy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Share your GPU without MIG or MPS☆51Jan 27, 2026Updated 4 months ago
- Nim wrapper for Sandia-OpenSHMEM☆11Sep 9, 2022Updated 3 years ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- INTERVAL field for PostgreSQL (and an approximation for other backends)☆21Jul 27, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Sample benchmark demonstrating the VK_NV_cooperative_vector extension☆15Dec 22, 2025Updated 5 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Mar 12, 2024Updated 2 years ago
- Manipulate Python Objects in Moonbit!☆33Jan 6, 2026Updated 5 months ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆44Aug 3, 2025Updated 10 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Mar 30, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- Utility module for AWS CloudFront in Rust allowing you to create signed urls and cookies☆13Mar 14, 2025Updated last year
- Python 2.7 hashing and iteration in Python 3+☆18Nov 20, 2022Updated 3 years ago
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 5 months ago
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆15Apr 23, 2026Updated last month
- DINOv2 inference engine written in C/C++ using ggml and OpenCV.☆96May 6, 2025Updated last year
- This is an android app about monkey image classification☆11Jun 16, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Haskell port of the C++ smallpt path tracer☆16Dec 14, 2020Updated 5 years ago
- (elastic) cuckoo hashing☆17Jun 20, 2020Updated 5 years ago
- A Dockerfile and setup to run lollms-webui in a containerized environment☆19Sep 4, 2023Updated 2 years ago
- The documents for SRS☆17May 17, 2026Updated 3 weeks ago
- A bot that provides Youtube vid chapters on Twitter (a.k.a. X )☆12Feb 5, 2025Updated last year
- ☆14Dec 21, 2025Updated 5 months ago
- ☆18Apr 7, 2026Updated 2 months ago
- ☆63Jul 10, 2025Updated 11 months ago
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Real time streaming demo for testing whispercpp on Android☆12May 6, 2024Updated 2 years ago
- ☆40Mar 25, 2023Updated 3 years ago
- Course website for Systems Verification Fall 2024☆14Jul 10, 2025Updated 11 months ago
- Experiments with BitNet inference on CPU☆57Apr 1, 2024Updated 2 years ago
- routing your gemini-cli to openai 3rd party providers☆25Jul 21, 2025Updated 10 months ago
- A standalone CXL-enabled system simulator.☆21Apr 19, 2026Updated last month
- In-browser LLM website generator☆51Jan 28, 2025Updated last year