smpanaro / more-ane-transformersLinks
Run transformers (incl. LLMs) on the Apple Neural Engine.
☆63Updated 2 years ago
Alternatives and similar repositories for more-ane-transformers
Users that are interested in more-ane-transformers are comparing it to the libraries listed below
Sorting:
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆117Updated 11 months ago
- ☆57Updated 2 years ago
- FlashAttention (Metal Port)☆560Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- ☆194Updated 8 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆234Updated last month
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆233Updated last month
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆24Updated last month
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆285Updated 5 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆226Updated last year
- ☆24Updated 2 years ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆206Updated 6 months ago
- run embeddings in MLX☆96Updated last year
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆179Updated last year
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆458Updated 10 months ago
- Local ML voice chat using high-end models.☆178Updated last month
- FastMLX is a high performance production ready API to host MLX models.☆337Updated 8 months ago
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆35Updated last year
- ☆76Updated last year
- Fast parallel LLM inference for MLX☆234Updated last year
- Print all known information about the GPU on Apple-designed chips☆94Updated last month
- Swift implementation of Flux.1 using mlx-swift☆112Updated 4 months ago
- ☆302Updated 7 months ago
- A tool which checks compatibility of CoreML model with Apple Neural Engine☆13Updated 3 years ago
- For inferring and serving local LLMs using the MLX framework☆108Updated last year
- C API for MLX☆155Updated last week
- Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon☆275Updated last month
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆62Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!☆28Updated last year
- mlx image models for Apple Silicon machines☆87Updated 2 weeks ago