smpanaro / more-ane-transformersLinks
Run transformers (incl. LLMs) on the Apple Neural Engine.
☆61Updated last year
Alternatives and similar repositories for more-ane-transformers
Users that are interested in more-ane-transformers are comparing it to the libraries listed below
Sorting:
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆104Updated 5 months ago
- ☆23Updated 2 years ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- ☆54Updated 2 years ago
- ☆174Updated 2 months ago
- FlashAttention (Metal Port)☆492Updated 8 months ago
- mlx image models for Apple Silicon machines☆80Updated last month
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆57Updated last year
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆22Updated last year
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆165Updated last week
- Swift implementation of Flux.1 using mlx-swift☆81Updated 5 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆268Updated last week
- 1.58 Bit LLM on Apple Silicon using MLX☆212Updated last year
- ☆94Updated 2 months ago
- Fast parallel LLM inference for MLX☆189Updated 10 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆81Updated last year
- Local ML voice chat using high-end models.☆167Updated last week
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 11 months ago
- Try CoreML models on multiple images and videos easily and quickly☆39Updated last year
- MLX Model Manager unifies loading and inferencing with LLMs and VLMs.☆94Updated 4 months ago
- Print all known information about the GPU on Apple-designed chips☆79Updated 9 months ago
- For inferring and serving local LLMs using the MLX framework☆104Updated last year
- LLM training in simple, raw C/Metal Shading Language☆54Updated last year
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆216Updated 2 weeks ago
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆30Updated 11 months ago
- Run embedding models locally in Swift using MLTensor.☆92Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervision☆23Updated 4 months ago
- mlx implementations of various transformers, speedups, training☆34Updated last year
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆181Updated last month
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆50Updated last year