ggml-org / ciLinks
CI for ggml and related projects
☆30Updated 4 months ago
Alternatives and similar repositories for ci
Users that are interested in ci are comparing it to the libraries listed below
Sorting:
- LLM-based code completion engine☆190Updated last year
- Transformer GPU VRAM estimator☆68Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- llama.cpp fork used by GPT4All☆55Updated 11 months ago
- Command line tool for Deep Infra cloud ML inference service☆34Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆51Updated 11 months ago
- LLM powered development for IntelliJ☆84Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆170Updated 9 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆203Updated 4 months ago
- LLM inference in C/C++☆104Updated last week
- The official Python library for Formulaic☆18Updated last year
- Granite 3.1 Language Models☆137Updated 7 months ago
- ☆166Updated 6 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago
- AMD related optimizations for transformer models☆97Updated 3 months ago
- 🔊 We believe in a future where developers are amplified, not automated☆117Updated 4 months ago
- ☆60Updated last week
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated 2 years ago
- Examples of calling OpenRouter models from Python code☆88Updated 9 months ago
- C API for MLX☆172Updated last week
- MLX support for the Open Neural Network Exchange (ONNX)☆63Updated last year
- ☆85Updated 2 months ago
- 🏥 Health monitor for a Petals swarm☆40Updated last year
- ☆68Updated last year
- ☆34Updated 9 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Python bindings for ggml☆147Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆243Updated last year
- ☆172Updated 11 months ago