Robust Speech Recognition via Large-Scale Weak Supervision
β95,206Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for whisper
Users that are interested in whisper are comparing it to the libraries listed below
Sorting:
- Port of OpenAI's Whisper model in C/C++β47,067Updated this week
- πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionβ44,691Aug 16, 2024Updated last year
- Faster Whisper transcription with CTranslate2β21,176Nov 19, 2025Updated 3 months ago
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β163,632Updated this week
- LLM inference in C/C++β96,322Updated this week
- π¦π The platform for reliable agents.β127,809Updated this week
- π Text-Prompted Generative Audio Modelβ39,006Aug 19, 2024Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,368Feb 22, 2026Updated last week
- Stable Diffusion web UIβ161,451Updated this week
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus oβ¦β182,031Updated this week
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β157,071Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,171May 27, 2025Updated 9 months ago
- LlamaIndex is the leading document agent and OCR platformβ47,210Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β104,246Updated this week
- A latent text-to-image diffusion modelβ72,575Jun 18, 2024Updated last year
- Inference code for Llama modelsβ59,183Jan 26, 2025Updated last year
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β125,513Updated this week
- Examples and guides for using the OpenAI APIβ71,720Feb 25, 2026Updated last week
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,025Apr 19, 2025Updated 10 months ago
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β41,855Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ71,883Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,444Aug 17, 2024Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,426Jun 2, 2025Updated 9 months ago
- The definitive Web UI for local AI, with powerful features and easy setup.β46,130Updated this week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ inβ¦β176,657Updated this week
- A feature-rich command-line audio/video downloaderβ149,202Updated this week
- Production-ready platform for agentic workflow development.β130,750Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,029Mar 13, 2025Updated 11 months ago
- A natural language interface for computersβ62,427Feb 9, 2026Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β54,071Nov 12, 2025Updated 3 months ago
- A programming framework for agentic AIβ54,956Jan 22, 2026Updated last month
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β53,029Updated this week
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.β18,086Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,706Updated this week
- Making large AI models cheaper, faster and more accessibleβ41,359Feb 23, 2026Updated last week
- f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source β self-host for your organiβ¦β148,271Updated this week
- SOTA Open Source TTSβ25,078Feb 2, 2026Updated last month
- Interact with your documents using the power of GPT, 100% privately, no data leaksβ57,143Updated this week
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,483Dec 15, 2025Updated 2 months ago