gitkaz / mlx_gguf_serverLinks
This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.
☆14Updated last week
Alternatives and similar repositories for mlx_gguf_server
Users that are interested in mlx_gguf_server are comparing it to the libraries listed below
Sorting:
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated 2 weeks ago
- AI Search engine☆12Updated 2 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆30Updated 5 months ago
- Snag web pages like a polite robot with a browser☆18Updated last week
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆28Updated 2 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16Updated 7 months ago
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆31Updated 8 months ago
- Gradio chat interface for FastMLX☆12Updated last year
- Condensing codebases to a single file for usage in long context LLMs (Gemini 1.5 Pro, GPT-4-Turbo, Claude Opus)☆13Updated last year
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- Simple GUI to load a PDF/Docx/txt file and have LM Studio Answer based off of it.☆14Updated last year
- ☆11Updated last year
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆38Updated 11 months ago
- ☆14Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆15Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated last month
- AI Agent capable of automating various tasks using MCP☆40Updated 8 months ago
- 🧠 Retrieval Augmented Generation (RAG) example☆18Updated 4 months ago
- Your Python AI Coder!☆35Updated 7 months ago
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆48Updated last month
- Example Optimizely clone created with GPT Pilot☆29Updated last year
- ☆32Updated last year
- Galleries for Models, Datasets, and Plugins used by Transformer Lab☆27Updated this week
- Ready-to-use agent that can interact directly with any tool or native endpoint, in less than 5 lines of code☆42Updated 2 months ago
- A simple Flask app that lets you text back and forth with Open Interpreter. Probably a bad idea.☆22Updated 2 years ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆11Updated last year
- A QT GUI for large language models☆38Updated last year