openai / whisperLinks
Robust Speech Recognition via Large-Scale Weak Supervision
☆90,961Updated 2 months ago
Alternatives and similar repositories for whisper
Users that are interested in whisper are comparing it to the libraries listed below
Sorting:
- Port of OpenAI's Whisper model in C/C++☆44,532Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆18,773Updated 3 weeks ago
- Faster Whisper transcription with CTranslate2☆19,083Updated 2 weeks ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,703Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆8,687Updated last week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,640Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆93,673Updated this week
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,387Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,250Updated 5 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆40,534Updated this week
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆56,821Updated last year
- ☆8,735Updated 3 weeks ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆45,244Updated this week
- Making large AI models cheaper, faster and more accessible☆41,237Updated last week
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.☆3,979Updated 10 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆43,445Updated last year
- 🔊 Text-Prompted Generative Audio Model☆38,713Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆63,144Updated this week
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆7,385Updated last week
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆9,887Updated last year
- Distribute and run LLMs with a single file.☆23,376Updated 2 weeks ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,100Updated this week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,667Updated 8 months ago
- ☆34,354Updated last year
- End-to-End Speech Processing Toolkit☆9,592Updated this week
- A PyTorch-based Speech Toolkit☆10,792Updated last week
- Open-source search and retrieval database for AI applications.☆24,388Updated this week
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆15,642Updated last week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆38,530Updated this week
- Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …☆28,386Updated last year