yazon / flexllamaLinks
π FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GPU support
β45Updated 3 weeks ago
Alternatives and similar repositories for flexllama
Users that are interested in flexllama are comparing it to the libraries listed below
Sorting:
- β176Updated 4 months ago
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama modelβ¦β59Updated 2 months ago
- β200Updated 3 months ago
- A real-time shared memory layer for multi-agent LLM systems.β50Updated 5 months ago
- The easiest & fastest way to run LLMs in your home labβ72Updated 2 weeks ago
- Python language chat with Ollama models locally, anthropic and openaiβ24Updated 8 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It proβ¦β67Updated last year
- ACE-Step: A Step Towards Music Generation Foundation Modelβ46Updated 7 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search β¦β47Updated 3 months ago
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backendsβ49Updated 4 months ago
- Personal voice assistant, with voice interruption and Twilio supportβ18Updated 9 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.β87Updated this week
- Generates breakthrough ideas from a single prompt through an 8 stage walkthrough, with optional research proposal paper.β58Updated 2 months ago
- A web application that converts speech to speech 100% privateβ81Updated 6 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.β31Updated 10 months ago
- the AI IDE for work, research, development, and play.β220Updated this week
- Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web β¦β90Updated last week
- β83Updated 9 months ago
- A persistent local memory for AI, LLMs, or Copilot in VS Code.β181Updated last month
- Adaptive Modular Network (AMN) a potentially novel machine learning architecture capable of producing models which can learn at inferenceβ¦β54Updated 9 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!β29Updated last week
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the β¦β58Updated 9 months ago
- Integrates AI tools into Microsoft Wordβ156Updated last year
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow youβ¦β237Updated 2 months ago
- β228Updated 7 months ago
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-β¦β38Updated 5 months ago
- β58Updated 10 months ago
- β19Updated 5 months ago
- Orpheus Chat WebUIβ74Updated 8 months ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Filesβ164Updated 10 months ago