ggml-org / ciLinks
CI for ggml and related projects
☆29Updated this week
Alternatives and similar repositories for ci
Users that are interested in ci are comparing it to the libraries listed below
Sorting:
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Thin wrapper around GGML to make life easier☆34Updated this week
- AirLLM 70B inference with single 4GB GPU☆13Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Web browser version of StarCoder.cpp☆45Updated last year
- Download full or partial git-lfs repos without temporarily using 2x disk space☆29Updated last year
- Rust crate for some audio utilities☆23Updated 2 months ago
- Training hybrid models for dummies.☆21Updated 4 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆40Updated this week
- Experiments with BitNet inference on CPU☆55Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 5 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆48Updated 3 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- First token cutoff sampling inference example☆29Updated last year
- run ollama & gguf easily with a single command☆50Updated last year
- Python bindings for ggml☆141Updated 9 months ago
- llama.cpp fork used by GPT4All☆55Updated 3 months ago
- Port of Facebook's LLaMA model in C/C++☆21Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆47Updated last year
- ☆19Updated 2 months ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆41Updated last year
- Mistral-7B finetuned for function calling☆16Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week