gitkaz / mlx_gguf_serverLinks
This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.
☆16Updated last month
Alternatives and similar repositories for mlx_gguf_server
Users that are interested in mlx_gguf_server are comparing it to the libraries listed below
Sorting:
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated last week
- Gradio chat interface for FastMLX☆12Updated last year
- AI Agent capable of automating various tasks using MCP☆40Updated 10 months ago
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆23Updated 3 weeks ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆38Updated 2 months ago
- Snag web pages like a polite robot with a browser☆25Updated this week
- AI Search engine☆13Updated 4 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆31Updated 6 months ago
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16Updated 9 months ago
- A set of tools to create synthetically-generated data from documents☆39Updated 5 months ago
- watch your screen while doing sales and fill your crm automatically☆17Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 3 months ago
- A QT GUI for large language models☆39Updated 2 years ago
- Python client for txtai☆14Updated 2 weeks ago
- Simple agent framework using Ollama tool calling☆10Updated last year
- Simple GUI to load a PDF/Docx/txt file and have LM Studio Answer based off of it.☆14Updated last year
- Example Optimizely clone created with GPT Pilot☆30Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆25Updated last year
- A simple Flask app that lets you text back and forth with Open Interpreter. Probably a bad idea.☆22Updated 2 years ago
- Run GEPA on your favorite non-python libraries.☆32Updated 2 weeks ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- an auto coder which automatically fixes errors and improves the code from simple user prompt☆37Updated last year
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆29Updated 2 years ago
- 🧠 Mem4AI: A LLM Friendly memory management library.☆35Updated last year
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆30Updated 10 months ago
- Call another MCP client from your MCP client. Offload context windows, delegate tasks, split between models☆30Updated 11 months ago
- A python script to loop through urls in a csv and look for specific keywords on the scraped homepage.☆16Updated 3 years ago
- Codebase exploration with AI research agents☆19Updated 11 months ago
- An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate h…☆22Updated 2 months ago