openai / whisperLinks
Robust Speech Recognition via Large-Scale Weak Supervision
☆91,747Updated 3 months ago
Alternatives and similar repositories for whisper
Users that are interested in whisper are comparing it to the libraries listed below
Sorting:
- Faster Whisper transcription with CTranslate2☆19,376Updated 3 weeks ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆19,025Updated last month
- Port of OpenAI's Whisper model in C/C++☆44,967Updated this week
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆8,789Updated this week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆43,710Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,714Updated last year
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆157,308Updated this week
- LLM inference in C/C++☆90,838Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆95,939Updated this week
- 🔊 Text-Prompted Generative Audio Model☆38,787Updated last year
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆15,786Updated this week
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆35,566Updated 7 months ago
- ☆8,746Updated last month
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆51,821Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,955Updated 6 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆117,009Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆39,865Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,512Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆50,560Updated 3 weeks ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆16,717Updated 2 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆13,727Updated 2 months ago
- Open-source search and retrieval database for AI applications.☆24,734Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆64,758Updated this week
- A PyTorch-based Speech Toolkit☆10,886Updated last week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆45,661Updated this week
- A latent text-to-image diffusion model☆71,955Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,233Updated 2 weeks ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆40,842Updated this week
- High-Resolution Image Synthesis with Latent Diffusion Models☆42,090Updated 5 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,745Updated 8 months ago