A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆85Aug 20, 2025Updated 8 months ago
Alternatives and similar repositories for mlx-pretrain
Users that are interested in mlx-pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Oct 18, 2025Updated 7 months ago
- Train Large Language Models on MLX.☆370May 8, 2026Updated last week
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- ☆14Apr 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- UMAP in pure MLX for Apple Silicon. 30x faster than umap-learn.☆43Mar 5, 2026Updated 2 months ago
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- ☆82Mar 19, 2026Updated 2 months ago
- FastMLX is a high performance production ready API to host MLX models.☆356Mar 18, 2025Updated last year
- ☆15May 17, 2024Updated 2 years ago
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlx☆33Mar 12, 2026Updated 2 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆384Updated this week
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆403Aug 15, 2025Updated 9 months ago
- Flash-MoE sidecar slot-bank runtime for large GGUF MoE models on Apple Silicon — llama.cpp fork☆99May 11, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple repository for training small reasoning models☆50Feb 17, 2026Updated 3 months ago
- Minimal Claude Code alternative powered by MLX☆46Jan 11, 2026Updated 4 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Jun 20, 2025Updated 10 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆225Nov 12, 2025Updated 6 months ago
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆76Mar 23, 2026Updated last month
- Introduction to MLX for Swift developers☆46Jun 23, 2025Updated 10 months ago
- A collection of optimizers for MLX☆58Dec 12, 2025Updated 5 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago
- ☆15Apr 26, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast parallel LLM inference for MLX☆249Jul 7, 2024Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆38Jun 21, 2024Updated last year
- The purpose of this repository is to discuss on Audio transformers☆14Apr 16, 2026Updated last month
- Distributed Inference for mlx LLm☆101Aug 1, 2024Updated last year
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆135Feb 27, 2026Updated 2 months ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆717May 9, 2026Updated last week
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆241Oct 28, 2025Updated 6 months ago
- Desktop frontend for Apple's Sharp model for monocular view synthesis☆17Dec 20, 2025Updated 5 months ago
- Letting Claude Code develop his own MCP tools :)☆121Mar 8, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 4 months ago
- Run Time Series Foundation Models on Apple Silicon☆32Feb 27, 2026Updated 2 months ago
- Deploy and scale Large Language Models (LLMs) in production.☆40Jul 20, 2024Updated last year
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆347Mar 3, 2025Updated last year
- This is a repo covers ai research papers pseudocodes☆17Jun 20, 2023Updated 2 years ago
- experiments with MLX☆68Dec 15, 2025Updated 5 months ago
- Smart reproducible analytical pipeline inspection☆21Feb 13, 2026Updated 3 months ago