A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.
☆327May 3, 2026Updated this week
Alternatives and similar repositories for mlx-openai-server
Users that are interested in mlx-openai-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆708Mar 10, 2026Updated last month
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆145Apr 20, 2026Updated 2 weeks ago
- ☆23Aug 1, 2025Updated 9 months ago
- FastMLX is a high performance production ready API to host MLX models.☆352Mar 18, 2025Updated last year
- High-performance MLX-based LLM inference engine for macOS with native Swift implementation☆557Apr 6, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 3 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆208Apr 1, 2026Updated last month
- Train Large Language Models on MLX.☆363Apr 23, 2026Updated last week
- Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM☆26Jun 12, 2025Updated 10 months ago
- Run LLMs with MLX☆5,089Apr 23, 2026Updated last week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆4,573Updated this week
- ☆35Feb 14, 2026Updated 2 months ago
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- ☆20Oct 25, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Clone your friends with iMessage and MLX☆34Jan 9, 2024Updated 2 years ago
- An experimental UI for Z-Image-Base and Turbo image generation, model porting to MLX.☆27Apr 4, 2026Updated last month
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Fastest way to scaffold FastHTML applications.☆36Sep 13, 2025Updated 7 months ago
- dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.☆124Mar 3, 2026Updated 2 months ago
- Find the hidden meaning of LLMs☆41Nov 13, 2025Updated 5 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆239Oct 28, 2025Updated 6 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆286Jun 16, 2025Updated 10 months ago
- Qwen Image models through MPS☆266Dec 31, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- javascript multivariate data visualization☆14Jan 10, 2017Updated 9 years ago
- Run GreenBitAI's Quantized LLMs on Apple Devices with MLX☆31Aug 27, 2025Updated 8 months ago
- Openscad lib to improve 3D printed vertical holes☆14Nov 23, 2017Updated 8 years ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 3 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 6 months ago
- ☆26Dec 11, 2025Updated 4 months ago
- The ultimate training toolkit for finetuning diffusion models☆34Jan 22, 2026Updated 3 months ago
- A pyproject.toml conversion tool for Poetry to uv migration☆20Dec 28, 2024Updated last year
- ☆39Aug 4, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jan 31, 2023Updated 3 years ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆360Apr 24, 2026Updated last week
- A framework for orchestrating AI agents using a mermaid graph☆76May 16, 2024Updated last year
- An MCP server for Raindrop.io (bookmarking service)☆20Apr 10, 2025Updated last year
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆910Feb 21, 2026Updated 2 months ago
- ☆21Oct 9, 2024Updated last year
- Support for Geckodriver (Firefox driver) within Appium☆18Updated this week