Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!
☆29Dec 11, 2025Updated 2 months ago
Alternatives and similar repositories for llama-cpp-connector
Users that are interested in llama-cpp-connector are comparing it to the libraries listed below
Sorting:
- ☆15Apr 9, 2025Updated 10 months ago
- ☆13Mar 10, 2025Updated 11 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆23Apr 1, 2025Updated 11 months ago
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated last week
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 2 weeks ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated 11 months ago
- A FastAPI application that integrates with Telegram using webhooks and OpenAI Agents SDK for AI-powered stock trading assistance, utilizi…☆16May 11, 2025Updated 9 months ago
- Unified API platform for free access to enterprise-grade AI models from Google, Groq, and OpenRouter. Industrial-ready integration with h…☆13Mar 14, 2025Updated 11 months ago
- Moondream MCP Server in Python☆44Jul 2, 2025Updated 7 months ago
- A simple CLI app which allows you to generate and deploy simple apps. MVP.☆21Aug 4, 2025Updated 6 months ago
- Open source Speechify alternative. Read PDFs and EPUBs with local models.☆37Nov 14, 2025Updated 3 months ago
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- An AI assistant building SDK in python☆43Sep 21, 2025Updated 5 months ago
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Feb 27, 2025Updated last year
- ☆19Jul 4, 2025Updated 7 months ago
- a browser gui for nvidia smi☆20Mar 17, 2025Updated 11 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆17Apr 25, 2025Updated 10 months ago
- ☆17Dec 16, 2024Updated last year
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21May 1, 2025Updated 10 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆53Mar 11, 2025Updated 11 months ago
- A forward proxy to turn network traffic into personal memory for AI agents☆36Updated this week
- Python language chat with Ollama models locally, anthropic and openai☆24Apr 13, 2025Updated 10 months ago
- ☆31Jun 29, 2025Updated 8 months ago
- Web application for roleplaying with AI-powered characters☆68Jul 8, 2025Updated 7 months ago
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 6 months ago
- ☆25Apr 26, 2025Updated 10 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆288Apr 14, 2025Updated 10 months ago
- Konda - The simplest way to use Conda environments on Google Colab.☆39Jan 16, 2026Updated last month
- xllamacpp - a Python wrapper of llama.cpp☆75Updated this week
- The DPAB-α Benchmark☆32Jan 15, 2025Updated last year
- ☆27Jun 11, 2025Updated 8 months ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- Genertaes control vectors for use with llama.cpp in GGUF format.☆38Mar 19, 2025Updated 11 months ago
- Orpheus Chat WebUI☆76Mar 27, 2025Updated 11 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Nov 20, 2025Updated 3 months ago
- Embed anything.☆27May 24, 2024Updated last year
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆212May 9, 2025Updated 9 months ago