Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!
☆29Dec 11, 2025Updated 2 months ago
Alternatives and similar repositories for llama-cpp-connector
Users that are interested in llama-cpp-connector are comparing it to the libraries listed below
Sorting:
- ☆15Apr 9, 2025Updated 10 months ago
- ☆13Mar 10, 2025Updated 11 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆23Apr 1, 2025Updated 11 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 2 weeks ago
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated last week
- Unified API platform for free access to enterprise-grade AI models from Google, Groq, and OpenRouter. Industrial-ready integration with h…☆13Mar 14, 2025Updated 11 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- Moondream MCP Server in Python☆44Jul 2, 2025Updated 7 months ago
- Open source Speechify alternative. Read PDFs and EPUBs with local models.☆37Nov 14, 2025Updated 3 months ago
- A simple CLI app which allows you to generate and deploy simple apps. MVP.☆21Aug 4, 2025Updated 6 months ago
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Feb 27, 2025Updated last year
- a browser gui for nvidia smi☆20Mar 17, 2025Updated 11 months ago
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21May 1, 2025Updated 10 months ago
- ☆17Dec 16, 2024Updated last year
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 9 months ago
- A forward proxy to turn network traffic into personal memory for AI agents☆36Feb 23, 2026Updated last week
- Python language chat with Ollama models locally, anthropic and openai☆24Apr 13, 2025Updated 10 months ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated 10 months ago
- Web application for roleplaying with AI-powered characters☆68Jul 8, 2025Updated 7 months ago
- ☆25Apr 26, 2025Updated 10 months ago
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 6 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆289Apr 14, 2025Updated 10 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 3 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆40Apr 5, 2025Updated 10 months ago
- Konda - The simplest way to use Conda environments on Google Colab.☆39Jan 16, 2026Updated last month
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆30May 7, 2025Updated 9 months ago
- ☆27Jun 11, 2025Updated 8 months ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- Orpheus Chat WebUI☆76Mar 27, 2025Updated 11 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Nov 20, 2025Updated 3 months ago
- Embed anything.☆27May 24, 2024Updated last year
- A zero-dependency prompt manager/catalog/library in a single HTML file. Everything is stored locally in your browser. Meow. 😼☆64Aug 14, 2025Updated 6 months ago
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆47Aug 13, 2025Updated 6 months ago
- Share sensitive information securely with a link that can only be viewed once.☆42Feb 16, 2026Updated last week
- Easy to use interface for the Whisper model optimized for all GPUs!☆488Feb 15, 2026Updated 2 weeks ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆25Oct 2, 2025Updated 5 months ago