smpanaro / coreml-llm-cli
CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.
☆94Updated 2 months ago
Alternatives and similar repositories for coreml-llm-cli:
Users that are interested in coreml-llm-cli are comparing it to the libraries listed below
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆58Updated last year
- Swift implementation of Flux.1 using mlx-swift☆78Updated 3 months ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆22Updated 9 months ago
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆28Updated 8 months ago
- ☆55Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆12Updated last year
- Swift Core ML Examples☆205Updated 3 months ago
- Implementation of F5-TTS in Swift using MLX☆58Updated 3 months ago
- A multi-platform SwiftUI frontend for running local LLMs with Apple's MLX framework.☆391Updated 4 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated 8 months ago
- Run embedding models locally in Swift using MLTensor.☆65Updated 3 weeks ago
- Try CoreML models on multiple images and videos easily and quickly☆38Updated last year
- mlx image models for Apple Silicon machines☆75Updated 3 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆57Updated 11 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆75Updated last year
- Profile your CoreML models directly from Python 🐍☆27Updated 4 months ago
- A minimalistic Swift implementation of the Jinja templating engine, specifically designed for parsing and rendering ML chat templates.☆46Updated last month
- MLX Model Manager unifies loading and inferencing with LLMs and VLMs.☆83Updated last month
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆105Updated 4 months ago
- Print all known information about the GPU on Apple-designed chips☆74Updated 6 months ago
- Fork of Apple MLX swift example with addition of macOS SwiftUI App☆52Updated last year
- ☆40Updated 9 months ago
- Swift library to work with llama and other large language models.☆250Updated last month
- FlashAttention (Metal Port)☆449Updated 5 months ago
- This package provides Swift bindings for llama.cpp☆24Updated last year
- ModernBERT model optimized for Apple Neural Engine.☆23Updated 2 months ago
- ☆144Updated 2 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆203Updated last month