This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx
☆27Mar 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for mlx-LLM-cheatsheet
Users that are interested in mlx-LLM-cheatsheet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- grep for context, not just text. Local-first CLI for searching documents, notes, memories, and project context.☆23Mar 8, 2026Updated 2 weeks ago
- experiments with MLX☆68Dec 15, 2025Updated 3 months ago
- Introduction to MLX for Swift developers☆46Jun 23, 2025Updated 9 months ago
- ☆21Oct 9, 2024Updated last year
- ☆48Jan 3, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆32Oct 30, 2025Updated 4 months ago
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆49Mar 16, 2026Updated last week
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆135Feb 11, 2026Updated last month
- ☆19Jul 31, 2025Updated 7 months ago
- This repo is for LinkedIn Learning course: Generative AI and LLMOps: Deploying & Managing LLMs in Production☆11Aug 12, 2024Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Oct 1, 2023Updated 2 years ago
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- On-device semantic search over Apple WWDC 2025 docs using MLX embeddings — SwiftUI app (WWDC OMT 2025)☆76Jun 12, 2025Updated 9 months ago
- Realtime Transcription with Voxtral in MLX☆90Feb 8, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆133Feb 27, 2026Updated 3 weeks ago
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Jun 18, 2025Updated 9 months ago
- 📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual b…☆44Mar 16, 2026Updated last week
- BH hackathon☆14Apr 4, 2024Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Jun 20, 2025Updated 9 months ago
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆27Jan 19, 2025Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆347Mar 18, 2025Updated last year
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆688Mar 10, 2026Updated 2 weeks ago
- ☆61Aug 1, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Custom nodes for use with Vuo.☆13Dec 18, 2018Updated 7 years ago
- Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…☆291Jan 24, 2026Updated 2 months ago
- Swift implementation of Flux.1 using mlx-swift☆117Aug 10, 2025Updated 7 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Implementation of Visual Intelligence Using SmolVLM 2 by Hugging Face☆39Jan 15, 2026Updated 2 months ago
- Abelian sandpiles☆16Nov 16, 2024Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆102Jun 29, 2025Updated 8 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆202Jan 13, 2026Updated 2 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Jan 23, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 10 months ago
- ☆19Dec 31, 2025Updated 2 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆397Aug 15, 2025Updated 7 months ago
- Converting CR10SPro to Voron Switchwire☆16Jul 10, 2023Updated 2 years ago
- Utility for generating html elements with tagged`template literal`. Only 649 bytes.☆12Sep 25, 2024Updated last year
- 👀 HOC for creating aware components in ReactVR☆13Oct 13, 2017Updated 8 years ago
- MiniLM (BERT) embeddings from scratch☆20Aug 14, 2025Updated 7 months ago