FastMLX is a high performance production ready API to host MLX models.
☆358Mar 18, 2025Updated last year
Alternatives and similar repositories for fastmlx
Users that are interested in fastmlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆394May 13, 2026Updated last month
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆5,008Updated this week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆724May 9, 2026Updated last month
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆285Jun 16, 2025Updated 11 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆135Feb 27, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- On-device Image Generation for Apple Silicon☆703Apr 11, 2025Updated last year
- MLX native implementations of state-of-the-art generative image models☆2,126Jun 7, 2026Updated last week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆103Jun 29, 2025Updated 11 months ago
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆77Mar 23, 2026Updated 2 months ago
- Start a server from the MLX library.☆199Jul 26, 2024Updated last year
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆831Mar 12, 2025Updated last year
- For inferring and serving local LLMs using the MLX framework☆114Mar 24, 2024Updated 2 years ago
- The easiest way to run the fastest MLX-based LLMs locally☆327Oct 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scripts to create your own moe models using mlx☆90Feb 26, 2024Updated 2 years ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 6 months ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆148May 20, 2026Updated 3 weeks ago
- ☆21Oct 9, 2024Updated last year
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- MLX-GUI MLX Inference Server for Apple Silicone☆210Apr 1, 2026Updated 2 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 9 months ago
- Implementation of nougat that focuses on processing pdf locally.☆85Jan 15, 2025Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Jun 20, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of F5-TTS in MLX☆634Mar 19, 2025Updated last year
- mlx image models for Apple Silicon machines☆97Apr 8, 2026Updated 2 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆64Apr 14, 2024Updated 2 years ago
- MLX version of DINO DETR☆16Dec 26, 2024Updated last year
- ☆227Jun 3, 2026Updated last week
- Train Large Language Models on MLX.☆377Updated this week
- A repo of useful MLX skills.☆85Jan 25, 2026Updated 4 months ago
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆939May 8, 2024Updated 2 years ago
- Examples in the MLX framework☆8,717Apr 6, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆463Jan 29, 2025Updated last year
- An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.☆1,595Sep 6, 2024Updated last year
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆124Nov 10, 2024Updated last year
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆276Nov 9, 2025Updated 7 months ago
- A collection of optimizers for MLX☆57Dec 12, 2025Updated 6 months ago
- MLX Image Models☆24Mar 14, 2024Updated 2 years ago
- ☆77Nov 22, 2024Updated last year