A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.
☆337May 18, 2026Updated last week
Alternatives and similar repositories for mlx-openai-server
Users that are interested in mlx-openai-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆716May 9, 2026Updated 2 weeks ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆147Updated this week
- ☆23Aug 1, 2025Updated 9 months ago
- FastMLX is a high performance production ready API to host MLX models.☆357Mar 18, 2025Updated last year
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous bat…☆1,223May 17, 2026Updated last week
- ☆44Jun 27, 2025Updated 10 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆210Apr 1, 2026Updated last month
- Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM☆26Jun 12, 2025Updated 11 months ago
- Train Large Language Models on MLX.☆372May 8, 2026Updated 2 weeks ago
- Run LLMs with MLX☆5,387May 19, 2026Updated last week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆4,779Updated this week
- ☆35Feb 14, 2026Updated 3 months ago
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Oct 25, 2025Updated 7 months ago
- Instant Perfect Native MacOS Transcription☆54Jul 26, 2025Updated 10 months ago
- Clone your friends with iMessage and MLX☆35Jan 9, 2024Updated 2 years ago
- dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.☆127Mar 3, 2026Updated 2 months ago
- Sample project for F5-TTS using MLX Swift☆54Jan 15, 2026Updated 4 months ago
- Find the hidden meaning of LLMs☆41Nov 13, 2025Updated 6 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆242Oct 28, 2025Updated 6 months ago
- Fastest way to scaffold FastHTML applications.☆37Sep 13, 2025Updated 8 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆285Jun 16, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Forked from ggerganov/llama.cpp☆17Updated this week
- Qwen Image models through MPS☆267Dec 31, 2025Updated 4 months ago
- ☆15Feb 23, 2026Updated 3 months ago
- Complete automated setup guide for Qwen3-Coder-480B-A35B-Instruct model installation on Ubuntu with NVIDIA GPUs☆44Aug 3, 2025Updated 9 months ago
- Zed extension for Exa's MCP server☆25Mar 11, 2026Updated 2 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆135Feb 27, 2026Updated 2 months ago
- Curated list of mental gems and streams to fuel your cognition☆13Mar 1, 2026Updated 2 months ago
- javascript multivariate data visualization☆14Jan 10, 2017Updated 9 years ago
- ☆14May 26, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Run GreenBitAI's Quantized LLMs on Apple Devices with MLX☆31Aug 27, 2025Updated 8 months ago
- Openscad lib to improve 3D printed vertical holes☆14Nov 23, 2017Updated 8 years ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 4 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 7 months ago
- ☆30May 11, 2026Updated 2 weeks ago
- CIFAR-10 speedrun: Trains to 94% accuracy in 1.98 seconds on a single NVIDIA A100 GPU.☆76Oct 17, 2025Updated 7 months ago
- ComfyUI for Audio☆42Sep 21, 2025Updated 8 months ago