mzau / mlx-knifeLinks
ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)
☆120Updated last week
Alternatives and similar repositories for mlx-knife
Users that are interested in mlx-knife are comparing it to the libraries listed below
Sorting:
- MLX-GUI MLX Inference Server for Apple Silicone☆157Updated last week
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆69Updated last month
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆129Updated 2 weeks ago
- Train Large Language Models on MLX.☆236Updated last week
- Instant Perfect Native MacOS Transcription☆48Updated 4 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 5 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆236Updated last month
- powerful and fast tool calling agents☆79Updated 9 months ago
- For LLMs to better code with Jina API☆173Updated this week
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆18Updated 9 months ago
- ☆101Updated 6 months ago
- Metadspy: The framework for specifying—not programming—language models☆88Updated 6 months ago
- ☆107Updated last month
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Updated 6 months ago
- Personal project, Generative AI, Streamlit, Python☆54Updated 7 months ago
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- ☆78Updated last year
- ☆113Updated 5 months ago
- ☆90Updated 11 months ago
- ☆36Updated 10 months ago
- ☆188Updated 5 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆222Updated last month
- "Hey, Computer" from Star Trek. Talk to your agent. Run hooks after trigger comands. Runs locally, cause shit's scary.☆32Updated this week
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Updated last year
- ☆20Updated last month
- Deep research agents using MiniMax-M2 interleaved thinking☆143Updated 3 weeks ago
- Dabarqus is incredibly fast RAG that runs everywhere.☆59Updated 10 months ago
- ☆50Updated 4 months ago
- Thoughtful Lightning AI Assistant - Dual-engine system with DeepSeek reasoning and Groq inference, featuring Gradio UI, secure API manage…☆20Updated 11 months ago
- Distributed Inference for mlx LLm☆99Updated last year