Python bindings for ggml
☆148Sep 2, 2024Updated last year
Alternatives and similar repositories for ggml-python
Users that are interested in ggml-python are comparing it to the libraries listed below
Sorting:
- CLIP inference in plain C/C++ with no extra dependencies☆552Jun 19, 2025Updated 8 months ago
- GGUF parser in Python☆28Aug 15, 2024Updated last year
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 2 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,882Jan 28, 2024Updated 2 years ago
- Python bindings for llama.cpp☆10,020Aug 15, 2025Updated 6 months ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- Tensor library for machine learning☆14,152Feb 27, 2026Updated last week
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆310Apr 11, 2024Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆249Jan 22, 2024Updated 2 years ago
- Falcon7B + Falcon40B support - in branch falcon40b. Now all good and working. But main action now in https://github.com/cmp-nct/ggllm.cpp☆10Sep 30, 2023Updated 2 years ago
- treelite runtime binding in Go☆13Jul 31, 2024Updated last year
- zlib with the build system replaced by zig☆15Apr 17, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆619Feb 17, 2025Updated last year
- A small logging proxy server for intercepting and logging code completion requests from copilot.☆13May 5, 2023Updated 2 years ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Jul 23, 2024Updated last year
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14May 1, 2024Updated last year
- A Datalog Framework for Python☆16Mar 8, 2023Updated 2 years ago
- Web browser version of StarCoder.cpp☆46Jul 30, 2023Updated 2 years ago
- Insert units and constants into source code, with GNU Emacs☆15Aug 27, 2024Updated last year
- A emacs plugin using baidu-translate-api☆11Nov 30, 2021Updated 4 years ago
- A fork of llama3.c used to do some R&D on inferencing☆22Dec 20, 2024Updated last year
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆568Aug 8, 2023Updated 2 years ago
- Local ML voice chat using high-end models.☆184Dec 13, 2025Updated 2 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Nov 6, 2023Updated 2 years ago
- C++ implementation of Qwen-LM☆617Dec 6, 2024Updated last year
- Re-implementation of local descriptor HardNet training in fasta2+kornia☆21Apr 6, 2020Updated 5 years ago
- Inference TinyLlama models on ncnn☆24Aug 15, 2023Updated 2 years ago
- C++ implementation for 💫StarCoder☆459Sep 9, 2023Updated 2 years ago
- ☆596Aug 23, 2024Updated last year
- ☆1,279Oct 24, 2023Updated 2 years ago
- ☆23Jun 4, 2024Updated last year
- example of using CoreML from c++☆24Jun 14, 2023Updated 2 years ago
- ModernBERT model optimized for Apple Neural Engine.☆31Jan 10, 2025Updated last year
- An implementation of Deepmind's Promptbreeder.☆22Dec 22, 2023Updated 2 years ago
- An experiment of trying out whisper.cpp for real-time speech-to-text☆20Dec 25, 2022Updated 3 years ago
- ggml implementation of BERT☆498Feb 23, 2024Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆304Jun 13, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure C☆19,213Aug 6, 2024Updated last year
- An experimental method JIT for CPython 3☆29May 18, 2016Updated 9 years ago