ggml-org / ci
CI for ggml and related projects
☆22Updated this week
Alternatives and similar repositories for ci:
Users that are interested in ci are comparing it to the libraries listed below
- AirLLM 70B inference with single 4GB GPU☆12Updated 5 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆52Updated 11 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆26Updated last week
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated 11 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆35Updated 8 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Web browser version of StarCoder.cpp☆43Updated last year
- Download full or partial git-lfs repos without temporarily using 2x disk space☆30Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 6 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆28Updated 2 weeks ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Command line tool for Deep Infra cloud ML inference service☆26Updated 7 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- Convert Python code into JSON consumable by OpenAI's function API.☆25Updated last year
- GRDN.AI app for garden optimization☆70Updated 11 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆27Updated 10 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆42Updated 5 months ago
- A QT GUI for large language models☆27Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆37Updated last month
- ☆21Updated 7 months ago
- Grok by X (Twitter) System Prompt Leak☆25Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆43Updated 3 months ago
- Efficiently computing & storing token n-grams from large corpora☆17Updated 3 months ago
- ☆24Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆76Updated last month
- ☆48Updated 7 months ago
- Github repo for Peifeng's internship project☆13Updated last year
- ☆22Updated 3 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago