A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.
☆248Mar 3, 2026Updated this week
Alternatives and similar repositories for mlx-openai-server
Users that are interested in mlx-openai-server are comparing it to the libraries listed below
Sorting:
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆673Dec 21, 2025Updated 2 months ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆129Feb 11, 2026Updated 3 weeks ago
- High-performance MLX-based LLM inference engine for macOS with native Swift implementation☆504Updated this week
- MLX-GUI MLX Inference Server for Apple Silicone☆194Jan 13, 2026Updated last month
- FastMLX is a high performance production ready API to host MLX models.☆346Mar 18, 2025Updated 11 months ago
- javascript multivariate data visualization☆14Jan 10, 2017Updated 9 years ago
- Clone your friends with iMessage and MLX☆34Jan 9, 2024Updated 2 years ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- A framework for building programmable applications☆29Jan 26, 2023Updated 3 years ago
- Explore Building Computer Use Agents with Gemini 2.0☆19Dec 12, 2024Updated last year
- Train Large Language Models on MLX.☆273Feb 27, 2026Updated last week
- OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous bat…☆511Feb 25, 2026Updated last week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆228Oct 28, 2025Updated 4 months ago
- ☆15Feb 23, 2026Updated last week
- Openscad lib to improve 3D printed vertical holes☆14Nov 23, 2017Updated 8 years ago
- ☆15Updated this week
- Implementation of ModernBERT in MLX☆20Jan 7, 2026Updated 2 months ago
- CLI for Recursive Language Models☆52Jan 28, 2026Updated last month
- Introduction to MLX for Swift developers☆45Jun 23, 2025Updated 8 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆284Jun 16, 2025Updated 8 months ago
- Fast parallel LLM inference for MLX☆247Jul 7, 2024Updated last year
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆18Mar 15, 2025Updated 11 months ago
- Support for Geckodriver (Firefox driver) within Appium☆18Feb 16, 2026Updated 2 weeks ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,212Updated this week
- A framework for orchestrating AI agents using a mermaid graph☆76May 16, 2024Updated last year
- ☆52Jan 20, 2026Updated last month
- ☆20Oct 25, 2025Updated 4 months ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆39Nov 21, 2025Updated 3 months ago
- Sample project for F5-TTS using MLX Swift☆50Jan 15, 2026Updated last month
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆285Feb 9, 2026Updated 3 weeks ago
- Memory Agent monorepo☆83Oct 9, 2025Updated 4 months ago
- An MCP server for Raindrop.io (bookmarking service)☆20Apr 10, 2025Updated 10 months ago
- FastMLX is a high performance production ready API to host MLX models.☆25Nov 18, 2024Updated last year
- ☆21Oct 9, 2024Updated last year
- Hyperparam local dataset viewer☆27Updated this week
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 2 months ago
- A Model Context Protocol (MCP) server that provides file system context to Large Language Models (LLMs). This server enables LLMs to read…☆35Jul 10, 2025Updated 7 months ago
- Qwen Image models through MPS☆260Dec 31, 2025Updated 2 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Oct 15, 2024Updated last year