grorge123 / mlx-llm.cppLinks
☆23Updated 2 weeks ago
Alternatives and similar repositories for mlx-llm.cpp
Users that are interested in mlx-llm.cpp are comparing it to the libraries listed below
Sorting:
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆59Updated last year
- Swift implementation of Flux.1 using mlx-swift☆104Updated 3 weeks ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆52Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 7 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆193Updated 2 weeks ago
- MLX Model Manager unifies loading and inferencing with LLMs and VLMs.☆98Updated 7 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆83Updated last year
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆93Updated 2 months ago
- ☆116Updated 2 months ago
- ☆14Updated last month
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆95Updated last year
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆18Updated 5 months ago
- ☆23Updated 10 months ago
- Something similar to Apple Intelligence?☆61Updated last year
- run embeddings in MLX☆90Updated 11 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆129Updated this week
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆22Updated 2 months ago
- Local ML voice chat using high-end models.☆175Updated last week
- mlx image models for Apple Silicon machines☆84Updated 4 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 8 months ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆93Updated this week
- Distributed Inference for mlx LLm☆93Updated last year
- A collection of optimizers for MLX☆52Updated 3 weeks ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆174Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 11 months ago
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Updated last year
- CLI tool for text to image generation using the FLUX.1 model.☆64Updated 2 months ago
- Qwen Image models through MPS☆171Updated last week