Jaykef/mlx-rag-gguf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jaykef/mlx-rag-gguf)

Jaykef / mlx-rag-gguf

Minimal, clean code implementation of RAG with mlx using gguf model weights

☆52

Alternatives and similar repositories for mlx-rag-gguf

Users that are interested in mlx-rag-gguf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AbeEstrada / mlx-rag
View on GitHub
🧠 Retrieval Augmented Generation (RAG) example
☆19Apr 17, 2026Updated 3 months ago
vegaluisjose / mlx-rag
View on GitHub
Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.
☆180Jan 31, 2024Updated 2 years ago
riccardomusmeci / mlx-image
View on GitHub
mlx image models for Apple Silicon machines
☆100Apr 8, 2026Updated 3 months ago
enochyearn / MLX_RoBERTa
View on GitHub
Roberta Question Answering using MLX.
☆24Feb 22, 2026Updated 5 months ago
DAMO-NLP-SG / Multipurpose-Chatbot
View on GitHub
A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)
☆20Apr 18, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
antranapp / awesome-mlx
View on GitHub
☆235Updated this week
armbues / SiLLM
View on GitHub
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
☆283Jun 16, 2025Updated last year
Jaykef / min-patchnizer
View on GitHub
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…
☆11May 16, 2024Updated 2 years ago
robertmccraith / mimm
View on GitHub
MLX Image Models
☆24Mar 14, 2024Updated 2 years ago
Goekdeniz-Guelmez / MLX-Benchmark
View on GitHub
The best benchmark for LLMs on Apple's MLX framework knowledge and coding tasks.
☆36Jun 12, 2026Updated last month
ToluClassics / mlx-transformers
View on GitHub
MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…
☆78Updated this week
taylorai / mlx_embedding_models
View on GitHub
run embeddings in MLX
☆97Sep 27, 2024Updated last year
da-z / mlx-ui
View on GitHub
A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.
☆262Oct 25, 2025Updated 9 months ago
aidenaistar / local-gpt-author
View on GitHub
☆17May 8, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GusLovesMath / Local_LLM_Training_Apple_Silicon
View on GitHub
Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tu…
☆28May 29, 2024Updated 2 years ago
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
mlx-chat / mlx-chat-app
View on GitHub
Chat with MLX is a high-performance macOS application that connects your local documents to a personalized large language model (LLM).
☆178Mar 8, 2024Updated 2 years ago
Doriandarko / mlx-local-server
View on GitHub
A tiny server to run local inference on MLX model in the style of OpenAI
☆13Jan 31, 2024Updated 2 years ago
TroyDoesAI / AI_Research
View on GitHub
My Gen AI research
☆11Jun 3, 2024Updated 2 years ago
zhuzilin / faster-nougat
View on GitHub
Implementation of nougat that focuses on processing pdf locally.
☆85Jan 15, 2025Updated last year
mustafaaljadery / mlxserver
View on GitHub
Start a server from the MLX library.
☆198Jul 26, 2024Updated 2 years ago
ferologics / Piwork
View on GitHub
Work with Pi
☆15Feb 9, 2026Updated 5 months ago
rounak / MLXTinyGPT
View on GitHub
MLX Swift implementation of Andrej Karpathy's Let's build GPT video
☆64Apr 14, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PicoMLX / PicoMLXServer
View on GitHub
The easiest way to run the fastest MLX-based LLMs locally
☆327Oct 30, 2024Updated last year
kunal732 / MLX-Model-Manager
View on GitHub
MLX Model Manager unifies loading and inferencing with LLMs and VLMs.
☆102Jan 30, 2025Updated last year
mzbac / mlx-lora
View on GitHub
☆38Mar 12, 2024Updated 2 years ago
arcee-ai / fastmlx
View on GitHub
FastMLX is a high performance production ready API to host MLX models.
☆363Mar 18, 2025Updated last year
N8python / n8loom
View on GitHub
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆80Feb 11, 2025Updated last year
priontific / MLX-text-completion-notebook
View on GitHub
A simple Jupyter Notebook for learning MLX text-completion fine-tuning!
☆125Nov 10, 2024Updated last year
QuixiAI / dolphin-utils
View on GitHub
☆17Feb 23, 2026Updated 5 months ago
vithursant / nanoGPT_mlx
View on GitHub
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆123Feb 12, 2024Updated 2 years ago
johnmai-dev / ChatMLX
View on GitHub
🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.
☆833Mar 12, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
riccardomusmeci / mlx-llm
View on GitHub
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.
☆465Jan 29, 2025Updated last year
OoriData / Toolio
View on GitHub
GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…
☆138Feb 27, 2026Updated 5 months ago
SinclairHudson / traccc
View on GitHub
Gradio app to track objects in video and add visual effects
☆17Jul 24, 2025Updated last year
advanc3dUA / WohnungSuchen
View on GitHub
🏠🔍 Auto check for new apartments in Hamburg from various real estate provides
☆16Apr 15, 2026Updated 3 months ago
yorkeyao / Automated-Retail-Checkout
View on GitHub
Training with Product Digital Twins for AutoRetail Checkout
☆19Aug 29, 2023Updated 2 years ago
Narsil / whispering
View on GitHub
☆20Oct 5, 2025Updated 9 months ago
preternatural-explore / mlx-swift-chat
View on GitHub
A multi-platform SwiftUI frontend for running local LLMs with Apple's MLX framework.
☆435Oct 27, 2024Updated last year