yazon / flexllamaLinks
🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GPU support
☆24Updated last week
Alternatives and similar repositories for flexllama
Users that are interested in flexllama are comparing it to the libraries listed below
Sorting:
- A real-time shared memory layer for multi-agent LLM systems.☆43Updated last month
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 5 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆66Updated 8 months ago
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆49Updated this week
- Python language chat with Ollama models locally, anthropic and openai☆25Updated 3 months ago
- ☆152Updated this week
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆70Updated last week
- ☆109Updated this week
- ☆80Updated 5 months ago
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆53Updated 2 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆32Updated last month
- Generates breakthrough ideas from a single prompt through an 8 stage walkthrough, with optional research proposal paper.☆56Updated 4 months ago
- ☆28Updated last month
- A unified library for interacting with various AI APIs through a standardized interface.☆31Updated 4 months ago
- ☆132Updated 3 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 6 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 9 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆25Updated 2 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆42Updated 2 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆27Updated 5 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆125Updated 9 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 5 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆76Updated last month
- An fully autonomous agent that accesses the browser and performs tasks.☆17Updated 3 months ago
- ☆17Updated 7 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆30Updated last month
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆27Updated 3 months ago
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆57Updated 5 months ago
- Copy a bunch of files into your clipboard to provide context for LLMs☆109Updated last month