argmaxinc / WhisperKit
On-device Speech Recognition for Apple Silicon
☆4,291Updated 2 weeks ago
Alternatives and similar repositories for WhisperKit:
Users that are interested in WhisperKit are comparing it to the libraries listed below
- Examples using MLX Swift☆1,537Updated last week
- Examples in the MLX framework☆6,955Updated this week
- Swift API for MLX☆976Updated this week
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,595Updated 3 weeks ago
- 🎤 The easiest way to transcribe audio in Swift☆636Updated 8 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,741Updated last month
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆645Updated 9 months ago
- The open-source iOS app that's making quality voice transcription more accessible on mobile devices.☆796Updated 4 months ago
- CoreNet: A library for training deep neural networks☆7,002Updated 4 months ago
- ☆8,092Updated 8 months ago
- ML-powered speech recognition directly in your browser☆2,783Updated 4 months ago
- Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.☆4,856Updated 3 weeks ago
- Inference and training library for high-quality TTS models.☆5,017Updated 2 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆5,578Updated last month
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,578Updated 6 months ago
- MLX: An array framework for Apple silicon☆19,152Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,450Updated 6 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆7,506Updated last week
- A MLX port of FLUX based on the Huggingface Diffusers implementation.☆1,210Updated this week
- Use Ollama to talk to local LLMs in Apple Notes☆641Updated 4 months ago
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆698Updated 3 months ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,116Updated 2 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,564Updated 2 weeks ago
- A fast multimodal LLM for real-time voice☆3,589Updated this week
- chat with private and local large language models☆1,744Updated 2 weeks ago
- Swift Package to implement a transformers-like API in Swift☆837Updated this week
- Distributed LLM and StableDiffusion inference for mobile, desktop and server.☆2,779Updated 3 months ago
- On-device Diffusion Models for Apple Silicon☆581Updated 2 months ago
- AI wearables. Put it on, speak, transcribe, automatically☆4,163Updated this week
- Faster Whisper transcription with CTranslate2☆14,182Updated last month