RobertRiachi / ANE-Optimized-Whisper-OpenAILinks
☆55Updated 2 years ago
Alternatives and similar repositories for ANE-Optimized-Whisper-OpenAI
Users that are interested in ANE-Optimized-Whisper-OpenAI are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆63Updated last year
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆116Updated 8 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆84Updated last year
- Profile your CoreML models directly from Python 🐍☆28Updated 2 weeks ago
- mlx image models for Apple Silicon machines☆84Updated 5 months ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆23Updated last year
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆227Updated last month
- example of using CoreML from c++☆24Updated 2 years ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- Find out why your CoreML model isn't running on the Neural Engine!☆26Updated last year
- FlashAttention (Metal Port)☆534Updated 11 months ago
- Local ML voice chat using high-end models.☆175Updated 3 weeks ago
- LLM training in simple, raw C/Metal Shading Language☆57Updated last year
- OpenAI's Whisper ported to CoreML☆81Updated 2 years ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆59Updated last year
- LLM-based code completion engine☆192Updated 7 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆57Updated last year
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆137Updated 3 years ago
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆34Updated last year
- C API for MLX☆132Updated 2 weeks ago
- Python bindings for ggml☆146Updated last year
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆224Updated last year
- LLaVA server (llama.cpp).☆182Updated last year
- Export Hugging Face models to Core ML and TensorFlow Lite☆675Updated last year
- Port of Meta's Encodec in C/C++☆226Updated 9 months ago
- run embeddings in MLX☆92Updated 11 months ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆454Updated 7 months ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆196Updated 3 months ago
- ☆185Updated 6 months ago