gitkaz/mlx_gguf_server

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gitkaz/mlx_gguf_server)

gitkaz / mlx_gguf_server

This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.

☆17

Alternatives and similar repositories for mlx_gguf_server

Users that are interested in mlx_gguf_server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bcExpt1123 / chatgpt-example
View on GitHub
ChatGPT Example - How to integrate ChatGPT into web platform
☆10Feb 18, 2025Updated last year
PikoCanFly / django-daisy-seed
View on GitHub
A minimal Django starter template with Tailwind CSS and DaisyUI components, featuring a light/dark theme toggle and a custom user model. …
☆20Jul 20, 2025Updated 10 months ago
jeyabbalas / tabnet
View on GitHub
A TensorFlow 2 Keras implementation of TabNets.
☆13Feb 21, 2022Updated 4 years ago
michielst / auto-poster
View on GitHub
Automatically post images from a subreddit to an instagram account.
☆10Feb 24, 2022Updated 4 years ago
Telecommunication-Telemedia-Assessment / AVrateVoyager
View on GitHub
an online variant of AVrateNG
☆16Mar 20, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
duoduo70 / Neoforged-Events-List-Chinese
View on GitHub
Neoforged 事件列表的中文翻译
☆19Feb 6, 2025Updated last year
heyjaywilson / vue-nav
View on GitHub
Vuejs tutorial
☆14Apr 16, 2018Updated 8 years ago
Belluxx / LocalAIME
View on GitHub
Test your local LLMs on the AIME problems
☆39Jun 7, 2025Updated 11 months ago
LinuxDroidMaster / termux-nf
View on GitHub
A better way to install NerdFonts on Termux
☆22Mar 30, 2024Updated 2 years ago
Bonney / SwiftUI-BackgroundCard
View on GitHub
A simple SwiftUI ViewModifier to add a card-like background to a View.
☆11Oct 28, 2019Updated 6 years ago
stsfaroz / Graph-Convolutional-Networks-with-Cora-Dataset
View on GitHub
Graph neural networks (GNNs) is implemented with Cora datset using Spektral module.
☆13Nov 12, 2020Updated 5 years ago
ruteee / DetectionCausalRelationshipsTimeSeries
View on GitHub
Code for paper "A method for detecting causal relationships between industrial alarm variables using Transfer entropy and K2-Algorithm"
☆17Aug 3, 2022Updated 3 years ago
klikz-dev / LLM-VoIP-Caller
View on GitHub
This project is the backend engine for a fully autonomous AI-powered call center. It integrates a large language model (LLM), speech reco…
☆21Apr 18, 2025Updated last year
XiaoMengXinX / transition-ticket
View on GitHub
☆29Jul 4, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
winter1203 / vllm_GOT2_OCR
View on GitHub
Accelerating GOT-OCRv2 with VLLM
☆10Nov 15, 2024Updated last year
huseinzol05 / transformers-continuous-batching
View on GitHub
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆29Mar 15, 2025Updated last year
abnerjacobsen / fastapi-mvc-loguru-demo
View on GitHub
Demo app with Loguru logging, async middleware to generate X-request-Id. Works with Gunicorn or Uvicorn, and is safe to use with async/th…
☆10Feb 2, 2022Updated 4 years ago
AlbertoLourenco / CardMenu
View on GitHub
An animated card menu built with SwiftUI
☆10Apr 17, 2023Updated 3 years ago
Vonage-Community / tutorial-voice-messages-node-openai-integration
View on GitHub
Integrate Phone Calls and SMS with Open AI using the Vonage Voice and Messages APIs
☆11Apr 18, 2026Updated last month
JethroWangSir / SincQDR-VAD
View on GitHub
☆25Aug 29, 2025Updated 8 months ago
Arun70 / Separation-of-music-and-voice-using-matlab
View on GitHub
separating music and voice from a song
☆10Nov 29, 2018Updated 7 years ago
maxkura / Ask_Why
View on GitHub
A codex planning mode enhancing skill，works better than codex planning mode.
☆111Apr 25, 2026Updated 3 weeks ago
QBobWatson / python-ebml
View on GitHub
Pure python Matroska / EBML parser
☆16Jun 30, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lbjlaq / CursorChecker
View on GitHub
☆25May 26, 2025Updated 11 months ago
Nouf-Alabbasi / oKUmura_AI_Telecom_challenge
View on GitHub
☆13Jul 29, 2025Updated 9 months ago
lhermann / astro-firebase-example
View on GitHub
☆12Sep 5, 2021Updated 4 years ago
typoverflow / pytorch-crf
View on GitHub
条件随机场（CRF）的pytorch实现
☆10Mar 7, 2021Updated 5 years ago
slava13 / StockPriceMonitor
View on GitHub
Application that allows you to monitor your stock price depending on the amount of RSU that you have from this company. App goes to check…
☆11Jul 15, 2021Updated 4 years ago
lex8erna / UPGMApy
View on GitHub
A basic implementation of the UPGMA (Unweighted Pair Group Method with Arithmetic Mean) clustering algorithm in Python.
☆13Dec 11, 2015Updated 10 years ago
TomEverson / Astro-Firebase-Starter
View on GitHub
Astro SSG🚀 + Firebase🔥
☆11Jan 10, 2024Updated 2 years ago
asaddi / f5-tts-serve
View on GitHub
A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…
☆14Feb 7, 2025Updated last year
Matt-T-Git / ChatGPTCodeGen
View on GitHub
iOS App Demo To Generate swift code using OpenAI & ChatGPT
☆13Dec 19, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kurtvandusen / React-Native-Easy-Chatbot
View on GitHub
A chatbot demo app using Huggingface inference API, React Native, Expo, and Redux Saga. Build for Android, iOS, and Web.
☆13Dec 7, 2022Updated 3 years ago
Neilblaze / URL-Dinogame
View on GitHub
Google's Dinosaur game implementation in URL bar 🦕
☆50May 3, 2026Updated 2 weeks ago
LiteLDev / LiteLoader.NET
View on GitHub
LiteLoaderBDSv2 bindings for .NET (Obsoleted)
☆25Oct 13, 2023Updated 2 years ago
sorahjy / chinese_fuzzy_matching
View on GitHub
100行解决中文模糊实体识别with字典树和编辑距离 Chinese fuzzy entity matching with prefix tree and distance editing
☆11Sep 25, 2023Updated 2 years ago
jontonsoup4 / suno-clone
View on GitHub
A clone of the Suno AI website UI using NextJS and Tailwind
☆14May 27, 2024Updated last year
h9-tec / Qwen3_chat_local
View on GitHub
☆10Apr 30, 2025Updated last year
AmeddahAchraf / musicPlayerSwiftUI
View on GitHub
A SwiftUI-based Music Player inspired by the popular website Suno AI, showcasing modern iOS app development techniques. This was just a p…
☆15Nov 16, 2024Updated last year