Scripts to create your own moe models using mlx
☆89Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for mlx-moe
Users that are interested in mlx-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆352Mar 18, 2025Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated 11 months ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆81Feb 5, 2024Updated 2 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.☆97Feb 5, 2024Updated 2 years ago
- For inferring and serving local LLMs using the MLX framework☆114Mar 24, 2024Updated 2 years ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆284Jun 16, 2025Updated 9 months ago
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆262Oct 25, 2025Updated 5 months ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆180Jan 31, 2024Updated 2 years ago
- A simple LLaMA implementation using MLX.☆15Apr 22, 2024Updated last year
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆220Apr 6, 2026Updated last week
- ☆10Jul 6, 2023Updated 2 years ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆461Jan 29, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Feb 9, 2024Updated 2 years ago
- A simple Jupyter Notebook for learning MLX text-completion fine-tuning!☆124Nov 10, 2024Updated last year
- ☆81Mar 19, 2026Updated 3 weeks ago
- Start a server from the MLX library.☆200Jul 26, 2024Updated last year
- An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.☆1,594Sep 6, 2024Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Graph Neural Network library made for Apple Silicon☆210Mar 2, 2026Updated last month
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆63Apr 14, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BH hackathon☆14Apr 4, 2024Updated 2 years ago
- mlx image models for Apple Silicon machines☆95Updated this week
- Run large models from the terminal using Apple MLX.☆31Mar 18, 2024Updated 2 years ago
- ☆38Mar 12, 2024Updated 2 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- Your gateway to both Ollama & Apple MlX models☆153Mar 2, 2025Updated last year
- A Next.js chatbot app demonstrating seamless integration with window.ai.☆15Jun 25, 2023Updated 2 years ago
- ☆213Apr 5, 2026Updated last week
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆275Jan 10, 2026Updated 3 months ago
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 3 months ago
- Friendly Terminal Assistant for Developers☆17Mar 23, 2024Updated 2 years ago
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆102Jun 29, 2025Updated 9 months ago
- ☆28Feb 9, 2024Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year