mzau / mlx-knifeLinks
ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)
☆101Updated this week
Alternatives and similar repositories for mlx-knife
Users that are interested in mlx-knife are comparing it to the libraries listed below
Sorting:
- MLX-GUI MLX Inference Server for Apple Silicone☆120Updated 3 weeks ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆95Updated 2 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆195Updated last week
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆129Updated last week
- Train Large Language Models on MLX.☆159Updated last month
- Dabarqus is incredibly fast RAG that runs everywhere.☆60Updated 7 months ago
- For LLMs to better code with Jina API☆166Updated this week
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Updated last year
- ☆113Updated 2 months ago
- powerful and fast tool calling agents☆55Updated 5 months ago
- ☆102Updated 3 months ago
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆18Updated 5 months ago
- ☆104Updated 2 months ago
- Distributed Inference for mlx LLm☆95Updated last year
- ☆78Updated 8 months ago
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆62Updated 6 months ago
- Personal project, Generative AI, Streamlit, Python☆54Updated 4 months ago
- Instant Perfect Native MacOS Transcription☆47Updated last month
- FastMLX is a high performance production ready API to host MLX models.☆326Updated 5 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 2 months ago
- Thoughtful Lightning AI Assistant - Dual-engine system with DeepSeek reasoning and Groq inference, featuring Gradio UI, secure API manage…☆20Updated 7 months ago
- ☆89Updated 7 months ago
- ☆72Updated this week
- ☆36Updated 7 months ago
- A fork of OpenAI Swarm that supports Groq and Anthropic☆122Updated 6 months ago
- 🤖 Headless IDE for AI agents☆200Updated 4 months ago
- auto fine tune of models with synthetic data☆76Updated last year
- The easiest way to run the fastest MLX-based LLMs locally☆297Updated 10 months ago
- 9 separate websites IN SECONDS for you to chaotically edit!☆81Updated 3 weeks ago