A wannabe Ollama equivalent for Apple MlX models
☆84Mar 2, 2025Updated last year
Alternatives and similar repositories for PyOMlx
Users that are interested in PyOMlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Your gateway to both Ollama & Apple MlX models☆153Mar 2, 2025Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Apr 27, 2024Updated 2 years ago
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆263Oct 25, 2025Updated 7 months ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆129Dec 27, 2024Updated last year
- 🧠 Retrieval Augmented Generation (RAG) example☆19Apr 17, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Feb 5, 2024Updated 2 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated 2 years ago
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆831Mar 12, 2025Updated last year
- High-performance MLX-based LLM inference engine for macOS with native Swift implementation☆565Updated this week
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆21Nov 11, 2024Updated last year
- ☆38Mar 12, 2024Updated 2 years ago
- The easiest way to run the fastest MLX-based LLMs locally☆327Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example Optimizely clone created with GPT Pilot☆29Nov 7, 2024Updated last year
- ☆54Sep 18, 2024Updated last year
- Structured output generation in Swift☆74Apr 6, 2026Updated 2 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆5,008Updated this week
- User-friendly launchctl wrapper and helper functions☆25Feb 19, 2025Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆89Feb 11, 2024Updated 2 years ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆463Jan 29, 2025Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆285Jun 16, 2025Updated 11 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆64Apr 14, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ⚾ Data science project to predict if a Major League Baseball player will get a hit on any given day ⚾☆10Apr 11, 2023Updated 3 years ago
- LM Studio Apple MLX engine☆1,071Updated this week
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆29May 6, 2025Updated last year
- A screenshotting tool for thinking clearly☆17May 15, 2024Updated 2 years ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆181Jan 31, 2024Updated 2 years ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Declarative TabularData creation for Swift - Convert objects to DataFrames with type-safe, SwiftUI-like syntax☆46Jun 6, 2025Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated last year
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆227May 25, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- Screenshot alternative for MacOS's Shift+Cmd+4. 1 click to save to a common location / copy / icloud. 2 Clicks to share with someone☆29May 25, 2024Updated 2 years ago
- ☆13Sep 6, 2021Updated 4 years ago
- MLX implementation of GCN, with benchmark on MPS, CUDA and CPU (M1 Pro, M2 Ultra, M3 Max).☆25Dec 16, 2023Updated 2 years ago
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Apr 18, 2024Updated 2 years ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Sep 10, 2024Updated last year
- MCP is a command-line tool and local UI for discovering, installing and managing Model Context Protocol servers.☆14Dec 28, 2024Updated last year