This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.
☆16Feb 28, 2026Updated this week
Alternatives and similar repositories for mlx_gguf_server
Users that are interested in mlx_gguf_server are comparing it to the libraries listed below
Sorting:
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 16, 2026Updated 2 weeks ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 9 months ago
- Test your local LLMs on the AIME problems☆32Jun 7, 2025Updated 8 months ago
- Sample that demonstrates how to get started using Prism for Xamarin.Forms☆12Jun 8, 2015Updated 10 years ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆39Nov 21, 2025Updated 3 months ago
- Xamarin Evolve 2016 Slides and Samples☆13May 15, 2017Updated 8 years ago
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- An attendance bot that joins google meet automatically according to schedule and marks present in the google meet.☆12Sep 20, 2022Updated 3 years ago
- Many command line tools, bookmarklets and apps for productivity using the OpenAI chat completion API that models ChatGPT☆25Feb 7, 2026Updated 3 weeks ago
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4☆21Jan 16, 2024Updated 2 years ago
- Xamarin port of https://github.com/PaoloRotolo/AppIntro☆22Apr 10, 2022Updated 3 years ago
- A cross-platform npm package for converting `.docx` files to PDF. Supports Windows, macOS, and Linux. Includes functionality for converti…☆33Jul 18, 2025Updated 7 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated 11 months ago
- Plugin QGIS☆10Jan 16, 2023Updated 3 years ago
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆32Oct 19, 2024Updated last year
- List of resources helping you become a better AI engineer.☆39Jan 3, 2025Updated last year
- Automatically post images from a subreddit to an instagram account.☆10Feb 24, 2022Updated 4 years ago
- C/C++ Windows Process Injector for Educational Purposes.☆10Apr 2, 2021Updated 4 years ago
- Project-agnostic, composable configuration system for AI-assisted development workflows. Single source of truth for agentic tools (Claude…☆23Updated this week
- This is a frontend to the Inkscape command line feature to allow the user to perform batch conversions of SVG files.☆15Dec 10, 2013Updated 12 years ago
- Community developed SCOM Management Pack for VMware☆14Apr 28, 2021Updated 4 years ago
- This project is the backend engine for a fully autonomous AI-powered call center. It integrates a large language model (LLM), speech reco…☆21Apr 18, 2025Updated 10 months ago
- Bugtracker of novel-ebook.com☆12Aug 11, 2021Updated 4 years ago
- A production-ready SaaS UI theme for Astro. Designed to help you move from idea to launch quickly.☆32Dec 28, 2025Updated 2 months ago
- C++ Code☆11Aug 13, 2019Updated 6 years ago
- How to run a local server on LM Studio☆36Apr 26, 2024Updated last year
- websocket-protocol's implementation with multithread synchronization model in C++☆17Jul 23, 2017Updated 8 years ago
- Template repository for SillyTavern extensions using React and Webpack.☆15Updated this week
- separating music and voice from a song☆10Nov 29, 2018Updated 7 years ago
- Emotion based music recommender system☆11Mar 26, 2025Updated 11 months ago
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- Collected Latin files from the Perseus Digital Library☆13Jun 21, 2017Updated 8 years ago
- OData Browser for the iPhone☆26Aug 7, 2010Updated 15 years ago
- ☆13Oct 4, 2024Updated last year
- netease python2 inject hook☆13Jan 8, 2025Updated last year
- Build WSA Kernel with Docker☆17Oct 26, 2021Updated 4 years ago
- ☆11Sep 4, 2020Updated 5 years ago