argmaxinc / WhisperKit
On-device Speech Recognition for Apple Silicon
☆3,937Updated last week
Related projects ⓘ
Alternatives and complementary repositories for WhisperKit
- Examples using MLX Swift☆1,023Updated this week
- Examples in the MLX framework☆6,235Updated last week
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,366Updated this week
- An Open Source text-to-speech system built by inverting Whisper.☆3,982Updated 5 months ago
- Making the community's best AI chat models available to everyone.☆1,570Updated 3 weeks ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,615Updated 3 weeks ago
- Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.☆3,710Updated last week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,827Updated 3 months ago
- Inference and training library for high-quality TTS models.☆4,658Updated 3 weeks ago
- Foundational model for human-like, expressive TTS☆3,895Updated 3 months ago
- 🎤 The easiest way to transcribe audio in Swift☆598Updated 5 months ago
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆633Updated 2 weeks ago
- Open Source framework for voice and multimodal conversational AI☆3,385Updated this week
- ☆6,781Updated 3 weeks ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,547Updated 3 months ago
- first base model for full-duplex conversational audio☆1,560Updated last week
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,645Updated 4 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,660Updated last month
- MLX: An array framework for Apple silicon☆17,330Updated this week
- Build real-time multimodal AI applications 🤖🎙️📹☆4,010Updated this week
- Swift API for MLX☆670Updated last week
- Swift Package to implement a transformers-like API in Swift☆719Updated 2 weeks ago
- ☆7,741Updated 5 months ago
- ML-powered speech recognition directly in your browser☆2,581Updated last month
- An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own se…☆2,946Updated 6 months ago
- The open-source iOS app that's making quality voice transcription more accessible on mobile devices.☆748Updated last month
- Local realtime voice AI☆1,946Updated this week
- Swift app demonstrating Core ML Stable Diffusion☆2,577Updated 4 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆4,962Updated 3 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,183Updated this week