Synopsis / whisper_aneLinks
☆24Updated 2 years ago
Alternatives and similar repositories for whisper_ane
Users that are interested in whisper_ane are comparing it to the libraries listed below
Sorting:
- ☆58Updated 2 years ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆63Updated 2 years ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆119Updated last year
- Local ML voice chat using high-end models.☆181Updated last month
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆86Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!☆30Updated last year
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆37Updated last year
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆25Updated 3 months ago
- FlashAttention (Metal Port)☆577Updated last year
- A Swift library that runs Alpaca prediction locally to implement ChatGPT like app on Apple platform devices.☆94Updated 2 years ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆235Updated 2 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆63Updated last year
- Try CoreML models on multiple images and videos easily and quickly☆42Updated 2 months ago
- ☆42Updated 7 months ago
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆132Updated last month
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆284Updated 7 months ago
- 8-bit CUDA functions for PyTorch☆18Updated last year
- example of using CoreML from c++☆24Updated 2 years ago
- mlx implementations of various transformers, speedups, training☆33Updated 2 years ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- mlx image models for Apple Silicon machines☆91Updated 2 months ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆178Updated 2 years ago
- LLM-based code completion engine☆189Updated last year
- This package provides Swift bindings for llama.cpp☆26Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- ☆128Updated 7 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- 1.58 Bit LLM on Apple Silicon using MLX☆240Updated last year