jbarrow / mlx-playgroundLinks
mlx implementations of various transformers, speedups, training
☆33Updated last year
Alternatives and similar repositories for mlx-playground
Users that are interested in mlx-playground are comparing it to the libraries listed below
Sorting:
- run embeddings in MLX☆93Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- Full finetuning of large language models without large memory requirements☆94Updated last week
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- ☆116Updated 9 months ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆15Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆83Updated 8 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆112Updated last year
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆175Updated last year
- ☆67Updated last year
- LLaVA server (llama.cpp).☆182Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆272Updated last year
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆66Updated 10 months ago
- ☆38Updated last year
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆88Updated last year
- Distributed Inference for mlx LLm☆95Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 8 months ago
- ☆46Updated last year
- ☆135Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Gradio UI for a Cog API☆70Updated last year