madroidmaq / mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.
115Updated this week

Alternatives and similar repositories for mlx-omni-server:

Users that are interested in mlx-omni-server are comparing it to the libraries listed below