LLaVA server (llama.cpp).
☆183Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for llava-cpp-server
Users that are interested in llava-cpp-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Oct 30, 2023Updated 2 years ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backend☆495Nov 28, 2023Updated 2 years ago
- Port of Suno AI's Bark in C/C++ for fast inference☆55Apr 15, 2024Updated 2 years ago
- ☆1,274Oct 24, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CLIP inference in plain C/C++ with no extra dependencies☆558Jun 19, 2025Updated 10 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆858Nov 16, 2024Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆86Feb 21, 2024Updated 2 years ago
- Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb☆19Sep 15, 2025Updated 7 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆313Apr 11, 2024Updated 2 years ago
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 3 years ago
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆5,894Updated this week
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Nov 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Friendly Terminal Assistant for Developers☆17Mar 23, 2024Updated 2 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆727Oct 11, 2023Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆630Mar 9, 2026Updated last month
- Python bindings for llama.cpp☆10,264Updated this week
- LLM-based code completion engine☆192Jan 23, 2025Updated last year
- ☆135Nov 24, 2023Updated 2 years ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45May 16, 2024Updated last year
- Web App to transcribe memos using Whisper AI.☆18Oct 23, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,917Apr 14, 2026Updated 2 weeks ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆69Dec 20, 2023Updated 2 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆15Feb 17, 2023Updated 3 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,886Jan 28, 2024Updated 2 years ago
- ☆15Sep 8, 2023Updated 2 years ago
- Visual Studio Code extension for WizardCoder☆148Aug 1, 2023Updated 2 years ago
- Make any person bald!! Component of the paper: Learning to regulate 3D head shape by removing occluding hair from in-the-wild images.☆12Jun 6, 2022Updated 3 years ago
- High accuracy code-switching whisper / qwen3 transcription☆28Apr 20, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- A super simple web interface to perform blind tests on LLM outputs.☆29Mar 9, 2024Updated 2 years ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,511Mar 4, 2026Updated 2 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆63Apr 10, 2024Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆113Sep 10, 2024Updated last year
- ☆63Sep 23, 2024Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆85Aug 5, 2025Updated 9 months ago