cubist38 / mlx-openai-serverLinks

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

☆110

Alternatives and similar repositories for mlx-openai-server

Users that are interested in mlx-openai-server are comparing it to the libraries listed below

Sorting:

arcee-ai / fastmlx
FastMLX is a high performance production ready API to host MLX models.
☆331Updated 6 months ago
Goekdeniz-Guelmez / mlx-lm-lora
Train Large Language Models on MLX.
☆183Updated last week
Blaizzy / mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
☆210Updated last month
madroidmaq / mlx-omni-server
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…
☆573Updated last month
mzau / mlx-knife
ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)
☆103Updated last week
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆95Updated 3 months ago
RamboRogers / mlx-gui
MLX-GUI MLX Inference Server for Apple Silicone
☆124Updated last month
armbues / SiLLM
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
☆280Updated 3 months ago
ivanfioravanti / lmstudio_hf
A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.
☆63Updated 7 months ago
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆146Updated last week
ivanfioravanti / qwen-image-mps
Qwen Image models through MPS
☆212Updated 2 weeks ago
JosefAlbers / Phi-3-Vision-MLX
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon
☆272Updated last year
Goekdeniz-Guelmez / mlx-lm-lens
Find the hidden meaning of LLMs
☆27Updated 2 months ago
transformerlab / transformerlab-api
API Server for Transformer Lab
☆79Updated this week
mustafaaljadery / mlxserver
Start a server from the MLX library.
☆192Updated last year
iluxu / llmbasedos
llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work
☆279Updated last month
qnguyen3 / docpixie
Lightweight Vision native Multimodal Document Agent
☆119Updated last month
mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆96Updated last year
mzbac / mlx-llm-server
For inferring and serving local LLMs using the MLX framework
☆109Updated last year
mark-lord / MLX-text-completion-notebook
A simple Jupyter Notebook for learning MLX text-completion fine-tuning!
☆122Updated 10 months ago
lmstudio-ai / mlx-engine
LM Studio Apple MLX engine
☆790Updated last week
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆220Updated last year
Trans-N-ai / swama
High-performance MLX-based LLM inference engine for macOS with native Swift implementation
☆412Updated last week
JosefAlbers / whisper-turbo-mlx
Blazing fast whisper turbo for ASR (speech-to-text) tasks
☆217Updated 11 months ago
codelion / ellora
Enhancing LLMs with LoRA
☆159Updated 3 weeks ago
PicoMLX / PicoMLXServer
The easiest way to run the fastest MLX-based LLMs locally
☆302Updated 11 months ago
codelion / adaptive-classifier
A flexible, adaptive classification system for dynamic text classification
☆463Updated 2 weeks ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆259Updated 7 months ago
abhishekkrthakur / chat-ext
chrome & firefox extension to chat with webpages: local llms
☆126Updated 9 months ago
quelmap-inc / quelmap
Open Source Local Data Analysis Assistant.
☆41Updated this week