argmaxinc / WhisperKit
On-device Speech Recognition for Apple Silicon
☆4,127Updated this week
Alternatives and similar repositories for WhisperKit:
Users that are interested in WhisperKit are comparing it to the libraries listed below
- Examples using MLX Swift☆1,087Updated this week
- Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.☆4,298Updated 2 months ago
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,492Updated last month
- ML-powered speech recognition directly in your browser☆2,696Updated 3 months ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,699Updated last week
- Making the community's best AI chat models available to everyone.☆1,799Updated last month
- Inference and training library for high-quality TTS models.☆4,910Updated last month
- Foundational model for human-like, expressive TTS☆3,979Updated 5 months ago
- Mac app for Ollama☆1,423Updated last month
- An Open Source text-to-speech system built by inverting Whisper.☆4,080Updated last month
- An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own se…☆2,980Updated 8 months ago
- Swift API for MLX☆781Updated this week
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆667Updated 2 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,570Updated 5 months ago
- Examples in the MLX framework☆6,539Updated this week
- Build real-time multimodal AI applications 🤖🎙️📹☆4,588Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆5,400Updated 3 weeks ago
- Swift Package to implement a transformers-like API in Swift☆769Updated this week
- 🎤 The easiest way to transcribe audio in Swift☆622Updated 7 months ago
- ☆7,938Updated 7 months ago
- tiny vision language model☆6,732Updated this week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆8,063Updated this week
- A fast multimodal LLM for real-time voice☆2,760Updated this week
- Open Source framework for voice and multimodal conversational AI☆4,299Updated this week
- WhisperPlus: Faster, Smarter, and More Capable 🚀☆1,757Updated last week
- chat with private and local large language models☆936Updated this week
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆615Updated 8 months ago
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,783Updated 4 months ago
- Local realtime voice AI☆2,162Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,775Updated 3 months ago