ml-explore / mlxLinks
MLX: An array framework for Apple silicon
☆20,757Updated this week
Alternatives and similar repositories for mlx
Users that are interested in mlx are comparing it to the libraries listed below
Sorting:
- Examples in the MLX framework☆7,444Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆48,531Updated this week
- ☆8,623Updated 7 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆14,667Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,685Updated 3 weeks ago
- Tensor library for machine learning☆12,591Updated this week
- Python bindings for llama.cpp☆9,168Updated 3 weeks ago
- LLM inference in C/C++☆80,984Updated this week
- Go ahead and axolotl questions☆9,470Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆45,046Updated last week
- Inference code for CodeLlama models☆16,312Updated 9 months ago
- the AI-native open-source embedding database☆20,090Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆10,586Updated this week
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆26,540Updated this week
- Large Language Model Text Generation Inference☆10,155Updated this week
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆39,558Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,174Updated this week
- Fast and memory-efficient exact attention☆17,572Updated last week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆28,257Updated 2 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,824Updated 2 months ago
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆16,619Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,355Updated this week
- DSPy: The framework for programming—not prompting—language models☆24,538Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆23,128Updated this week
- Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.☆5,350Updated 2 months ago
- Port of OpenAI's Whisper model in C/C++☆40,207Updated last week
- Official inference library for Mistral models☆10,262Updated 2 months ago
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability v…☆8,154Updated this week
- tiny vision language model☆8,019Updated last week
- Development repository for the Triton language and compiler☆15,687Updated this week