ml-explore / mlxLinks
MLX: An array framework for Apple silicon
β22,755Updated this week
Alternatives and similar repositories for mlx
Users that are interested in mlx are comparing it to the libraries listed below
Sorting:
- Examples in the MLX frameworkβ7,978Updated last month
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.β48,036Updated this week
- β8,655Updated last year
- Python bindings for llama.cppβ9,715Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ62,548Updated this week
- Distribute and run LLMs with a single file.β23,341Updated last week
- Tensor library for machine learningβ13,532Updated this week
- LLM inference in C/C++β89,278Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,022Updated last week
- Go ahead and axolotl questionsβ10,753Updated this week
- SGLang is a fast serving framework for large language models and vision language models.β20,075Updated this week
- CoreNet: A library for training deep neural networksβ7,025Updated last month
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.β155,718Updated this week
- Inference Llama 2 in one file of pure Cβ18,912Updated last year
- Inference code for CodeLlama modelsβ16,360Updated last year
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β16,472Updated last month
- An Extensible Deep Learning Libraryβ2,285Updated this week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizatβ¦β12,069Updated this week
- Fast and memory-efficient exact attentionβ20,414Updated last week
- PyTorch native post-training libraryβ5,576Updated last week
- Ollama Python libraryβ8,827Updated last month
- Official inference library for Mistral modelsβ10,531Updated 7 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.β6,609Updated this week
- llama3 implementation one matrix multiplication at a timeβ15,195Updated last year
- Large Language Model Text Generation Inferenceβ10,643Updated this week
- Universal LLM Deployment Engine with ML Compilationβ21,590Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β114,494Updated this week
- Open-source search and retrieval database for AI applications.β24,264Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.