smpanaro / more-ane-transformersLinks
Run transformers (incl. LLMs) on the Apple Neural Engine.
☆61Updated last year
Alternatives and similar repositories for more-ane-transformers
Users that are interested in more-ane-transformers are comparing it to the libraries listed below
Sorting:
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆106Updated 6 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆214Updated last year
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆22Updated last year
- ☆180Updated 4 months ago
- ☆23Updated 2 years ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆273Updated last month
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆58Updated last year
- FlashAttention (Metal Port)☆506Updated 9 months ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆189Updated last month
- run embeddings in MLX☆90Updated 9 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆220Updated last week
- ☆55Updated 2 years ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆447Updated 5 months ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆179Updated last month
- ☆75Updated 7 months ago
- ☆292Updated 3 months ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆172Updated last year
- Local ML voice chat using high-end models.☆174Updated last week
- FastMLX is a high performance production ready API to host MLX models.☆313Updated 3 months ago
- mlx image models for Apple Silicon machines☆82Updated 3 months ago
- ☆104Updated 3 weeks ago
- MLX Model Manager unifies loading and inferencing with LLMs and VLMs.☆95Updated 5 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆83Updated last year
- For inferring and serving local LLMs using the MLX framework☆105Updated last year
- Fast parallel LLM inference for MLX☆198Updated last year
- Large Language Model (LLM) module for the Spezi Ecosystem☆243Updated last week
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆257Updated last month
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 6 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated last year