A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.
β274Mar 24, 2026Updated this week
Alternatives and similar repositories for mlx-openai-server
Users that are interested in mlx-openai-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Final Project for OOP Course - University of Science, VNUHCMβ10Feb 13, 2023Updated 3 years ago
- My personal blog about AI, ML and DL πβ11Aug 23, 2023Updated 2 years ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. Iβ¦β688Mar 10, 2026Updated 2 weeks ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)β135Feb 11, 2026Updated last month
- Runner-up team (2nd place) in AI4VN2022: Air Quality Forcasting Challengeβ31Jul 12, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β23Aug 1, 2025Updated 7 months ago
- OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batβ¦β665Updated this week
- FastMLX is a high performance production ready API to host MLX models.β347Mar 18, 2025Updated last year
- High-performance MLX-based LLM inference engine for macOS with native Swift implementationβ527Mar 10, 2026Updated 2 weeks ago
- β43Jun 27, 2025Updated 8 months ago
- MLX-GUI MLX Inference Server for Apple Siliconeβ202Jan 13, 2026Updated 2 months ago
- Train Large Language Models on MLX.β286Mar 11, 2026Updated 2 weeks ago
- Run LLMs with MLXβ4,103Mar 20, 2026Updated last week
- Llambada: Simple Text Controllable for accompaniment generationβ41Mar 14, 2026Updated last week
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.β2,338Updated this week
- Fast parallel LLM inference for MLXβ249Jul 7, 2024Updated last year
- Instant Perfect Native MacOS Transcriptionβ53Jul 26, 2025Updated 8 months ago
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.β82Nov 11, 2025Updated 4 months ago
- Clone your friends with iMessage and MLXβ34Jan 9, 2024Updated 2 years ago
- A framework for building programmable applicationsβ29Jan 26, 2023Updated 3 years ago
- Simple Tool Caller for llama.cppβ11Aug 12, 2024Updated last year
- Zed extension for Exa's MCP serverβ22Mar 11, 2026Updated 2 weeks ago
- Generate a llama-quantize command to copy the quantization parameters of any GGUFβ31Jan 23, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.β232Oct 28, 2025Updated 4 months ago
- Sample project for F5-TTS using MLX Swiftβ50Jan 15, 2026Updated 2 months ago
- Find the hidden meaning of LLMsβ40Nov 13, 2025Updated 4 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.β286Jun 16, 2025Updated 9 months ago
- Audio transcription using mlx whisper and vad silence processingβ17Oct 14, 2024Updated last year
- Qwen Image models through MPSβ263Dec 31, 2025Updated 2 months ago
- β15Feb 23, 2026Updated last month
- javascript multivariate data visualizationβ14Jan 10, 2017Updated 9 years ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For morβ¦β133Feb 27, 2026Updated 3 weeks ago
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Run GreenBitAI's Quantized LLMs on Apple Devices with MLXβ31Aug 27, 2025Updated 7 months ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignmentβ68Dec 5, 2022Updated 3 years ago
- β26Dec 11, 2025Updated 3 months ago
- β39Aug 4, 2025Updated 7 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.β317Mar 14, 2026Updated last week
- This is a project that translates a .pdf file, preserving the original layout of that .pdf file. [UPDATED] We have achieved the Second Prβ¦β111Nov 15, 2024Updated last year
- utility to create xast treesβ13Jul 31, 2023Updated 2 years ago