RobertRiachi / ANE-Optimized-Whisper-OpenAILinks
☆54Updated 2 years ago
Alternatives and similar repositories for ANE-Optimized-Whisper-OpenAI
Users that are interested in ANE-Optimized-Whisper-OpenAI are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆63Updated last year
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆116Updated 9 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆84Updated last year
- mlx image models for Apple Silicon machines☆85Updated 5 months ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆23Updated last year
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆34Updated last year
- Profile your CoreML models directly from Python 🐍☆28Updated last month
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆228Updated 2 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆26Updated last year
- example of using CoreML from c++☆23Updated 2 years ago
- FlashAttention (Metal Port)☆538Updated last year
- MLX support for the Open Neural Network Exchange (ONNX)☆59Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆59Updated last year
- Python bindings for ggml☆146Updated last year
- ☆189Updated 6 months ago
- C API for MLX☆137Updated 2 weeks ago
- Local ML voice chat using high-end models.☆174Updated last month
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- OpenAI's Whisper ported to CoreML☆80Updated 2 years ago
- LLM-based code completion engine☆190Updated 8 months ago
- Try CoreML models on multiple images and videos easily and quickly☆42Updated last year
- Port of Meta's Encodec in C/C++☆222Updated 10 months ago
- LLaVA server (llama.cpp).☆183Updated last year
- WebGPU LLM inference tuned by hand☆150Updated 2 years ago
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆138Updated 3 years ago
- MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.☆210Updated last month
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆223Updated last year