RobertRiachi / ANE-Optimized-Whisper-OpenAILinks
☆55Updated 2 years ago
Alternatives and similar repositories for ANE-Optimized-Whisper-OpenAI
Users that are interested in ANE-Optimized-Whisper-OpenAI are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆83Updated last year
- mlx image models for Apple Silicon machines☆82Updated 3 months ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆106Updated 6 months ago
- Profile your CoreML models directly from Python 🐍☆28Updated 9 months ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆61Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated last year
- Try CoreML models on multiple images and videos easily and quickly☆41Updated last year
- LLaVA server (llama.cpp).☆180Updated last year
- LLM training in simple, raw C/Metal Shading Language☆57Updated last year
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆58Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- C API for MLX☆117Updated this week
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆52Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated 2 years ago
- Local ML voice chat using high-end models.☆174Updated last week
- Port of Meta's Encodec in C/C++☆226Updated 7 months ago
- LLM-based code completion engine☆193Updated 5 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆220Updated last week
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆22Updated last year
- MLX support for the Open Neural Network Exchange (ONNX)☆53Updated last year
- Python bindings for ggml☆142Updated 10 months ago
- FlashAttention (Metal Port)☆506Updated 9 months ago
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆33Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆118Updated 10 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 6 months ago
- Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.☆172Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 10 months ago