Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.
☆181Jan 31, 2024Updated 2 years ago
Alternatives and similar repositories for mlx-rag
Users that are interested in mlx-rag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- run embeddings in MLX☆98Sep 27, 2024Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Apr 27, 2024Updated 2 years ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆386May 13, 2026Updated 2 weeks ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆264Oct 25, 2025Updated 7 months ago
- Run and train GPT-2 on Apple silicon☆36Feb 6, 2024Updated 2 years ago
- 🧠 Retrieval Augmented Generation (RAG) example☆19Apr 17, 2026Updated last month
- mlx image models for Apple Silicon machines☆98Apr 8, 2026Updated last month
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆134Feb 27, 2026Updated 3 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆38Jun 21, 2024Updated last year
- ☆83Mar 3, 2026Updated 2 months ago
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated last month
- Scripts to create your own moe models using mlx☆90Feb 26, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆60Feb 9, 2024Updated 2 years ago
- Examples in the MLX framework☆8,652Apr 6, 2026Updated last month
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆285Jun 16, 2025Updated 11 months ago
- mlx implementations of various transformers, speedups, training☆33Dec 14, 2023Updated 2 years ago
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆76Mar 23, 2026Updated 2 months ago
- FastMLX is a high performance production ready API to host MLX models.☆357Mar 18, 2025Updated last year
- Efficient framework-agnostic data loading☆474Oct 1, 2025Updated 7 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆88Feb 11, 2024Updated 2 years ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Jun 20, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆16Mar 30, 2026Updated last month
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆80Jan 28, 2024Updated 2 years ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆464Jan 29, 2025Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆85Jan 15, 2025Updated last year
- ☆224May 18, 2026Updated last week
- Roberta Question Answering using MLX.☆24Feb 22, 2026Updated 3 months ago
- Start a server from the MLX library.☆199Jul 26, 2024Updated last year
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆4,779Updated this week
- Changes in this fork has been merged to upstream.☆16Jun 10, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MLX Image Models☆24Mar 14, 2024Updated 2 years ago
- ☆38Mar 12, 2024Updated 2 years ago
- Swift package for reading and writing Safetensors files.☆13Feb 6, 2026Updated 3 months ago
- On-device Image Generation for Apple Silicon☆702Apr 11, 2025Updated last year
- Distributed Inference for mlx LLm☆101Aug 1, 2024Updated last year
- Run embedding models locally in Swift using MLTensor.☆147May 17, 2026Updated last week
- For inferring and serving local LLMs using the MLX framework☆113Mar 24, 2024Updated 2 years ago