ml-explore / mlx-examplesLinks
Examples in the MLX framework
☆7,751Updated 2 months ago
Alternatives and similar repositories for mlx-examples
Users that are interested in mlx-examples are comparing it to the libraries listed below
Sorting:
- MLX: An array framework for Apple silicon☆21,986Updated this week
- An Extensible Deep Learning Library☆2,227Updated last week
- An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.☆1,576Updated 11 months ago
- Examples using MLX Swift☆2,001Updated 2 weeks ago
- On-device Speech Recognition for Apple Silicon☆4,927Updated this week
- ☆8,646Updated 10 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,044Updated 4 months ago
- Run LLMs with MLX☆1,666Updated this week
- CoreNet: A library for training deep neural networks☆7,018Updated 3 months ago
- ☆3,005Updated 11 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,541Updated this week
- PyTorch native post-training library☆5,418Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,640Updated this week
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,852Updated last week
- A MLX port of FLUX based on the Huggingface Diffusers implementation.☆1,521Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,341Updated 5 months ago
- Tensor library for machine learning☆13,017Updated last week
- Go ahead and axolotl questions☆10,245Updated this week
- Training LLMs with QLoRA + FSDP☆1,526Updated 9 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,859Updated last year
- Large Language Model Text Generation Inference☆10,424Updated last week
- Making the community's best AI chat models available to everyone.☆1,978Updated 6 months ago
- ☆4,086Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,025Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,082Updated 2 weeks ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,605Updated last week
- Modeling, training, eval, and inference code for OLMo☆5,911Updated last week
- Blazingly fast LLM inference.☆6,027Updated last week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,271Updated last month
- Inference Llama 2 in one file of pure 🔥☆2,117Updated last year