This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx
☆33Mar 12, 2026Updated 3 months ago
Alternatives and similar repositories for mlx-LLM-cheatsheet
Users that are interested in mlx-LLM-cheatsheet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- experiments with MLX☆68Dec 15, 2025Updated 6 months ago
- Introduction to MLX for Swift developers☆46Jun 23, 2025Updated 11 months ago
- ☆21Oct 9, 2024Updated last year
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆148May 20, 2026Updated 3 weeks ago
- ☆19Jul 31, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 9 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Oct 1, 2023Updated 2 years ago
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- ☆82Mar 19, 2026Updated 2 months ago
- SmolVLM2 Demo☆188Mar 20, 2025Updated last year
- Examples on how to use various LLM providers with a Wine Classification problem☆129Apr 21, 2026Updated last month
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Jul 26, 2024Updated last year
- Code for Probabilistic Sequential Matrix Factorization☆15Apr 27, 2021Updated 5 years ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆135Feb 27, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Experimenting with conversational AI in iOS, macOS and visionOS apps☆113May 23, 2026Updated 3 weeks ago
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Jun 18, 2025Updated 11 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Jun 20, 2025Updated 11 months ago
- FastMLX is a high performance production ready API to host MLX models.☆358Mar 18, 2025Updated last year
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆27Jan 19, 2025Updated last year
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆724May 9, 2026Updated last month
- Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…☆306Jan 24, 2026Updated 4 months ago
- OpenSource deployment made easy☆10Jun 13, 2015Updated 11 years ago
- Fourier based drawing script☆16Mar 20, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Visual Intelligence Using SmolVLM 2 by Hugging Face☆39May 23, 2026Updated 3 weeks ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆103Jun 29, 2025Updated 11 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆210Apr 1, 2026Updated 2 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆94Jun 8, 2026Updated last week
- Attempt at Porting LTX-2 Video Model to Apple's MLX Machine Learning Framework☆114Apr 18, 2026Updated last month
- ☆19Dec 31, 2025Updated 5 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆404Aug 15, 2025Updated 10 months ago
- A simple script to enhance text editing across your Mac, leveraging the power of MLX. Designed for seamless integration, it offers real-t…☆109Mar 4, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Mar 24, 2023Updated 3 years ago
- This Python script uses YOLOv8 from Ultralytics for real-time object detection using OpenCV. The script initializes a camera, loads the Y…☆11Sep 6, 2024Updated last year
- ☆77Nov 22, 2024Updated last year
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆129Dec 27, 2024Updated last year
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- ☆15Feb 23, 2026Updated 3 months ago