mzbac / mlx_shardingLinks

Distributed Inference for mlx LLm

☆97

Alternatives and similar repositories for mlx_sharding

Users that are interested in mlx_sharding are comparing it to the libraries listed below

Sorting:

willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆223Updated last year
mzbac / mlx-llm-server
For inferring and serving local LLMs using the MLX framework
☆109Updated last year
teknium1 / ShareGPT-Builder
☆116Updated 10 months ago
Goekdeniz-Guelmez / mlx-lm-lora
Train Large Language Models on MLX.
☆196Updated 3 weeks ago
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆97Updated 3 months ago
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆42Updated 4 months ago
JosefAlbers / Phi-3-Vision-MLX
Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon
☆273Updated last year
QuixiAI / OpenChatML
☆162Updated 2 months ago
OoriData / Toolio
GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…
☆128Updated last month
remichu-ai / gallama
☆133Updated 6 months ago
da-z / mlx-ui
A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.
☆261Updated 4 months ago
armbues / SiLLM
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
☆278Updated 4 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 9 months ago
QuixiAI / kraken
☆67Updated last year
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
mzau / mlx-knife
ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)
☆108Updated last week
Jaykef / mlx-rag-gguf
Minimal, clean code implementation of RAG with mlx using gguf model weights
☆52Updated last year
mzbac / mlx-lora
☆38Updated last year
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 2 months ago
zhuzilin / faster-nougat
Implementation of nougat that focuses on processing pdf locally.
☆83Updated 9 months ago
arcee-ai / fastmlx
FastMLX is a high performance production ready API to host MLX models.
☆332Updated 7 months ago
adrienbrault / hf-gguf-to-ollama
Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.
☆118Updated last year
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆56Updated 11 months ago
chigkim / Ollama-MMLU-Pro
☆104Updated 2 months ago
mark-lord / MLX-text-completion-notebook
A simple Jupyter Notebook for learning MLX text-completion fine-tuning!
☆122Updated 11 months ago
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆224Updated last year
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆52Updated 8 months ago
Blaizzy / mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.
☆215Updated last month
stockeh / mlx-optimizers
A collection of optimizers for MLX
☆53Updated 2 weeks ago
mustafaaljadery / mlxserver
Start a server from the MLX library.
☆192Updated last year