sindresorhus / awesome-whisper
π Awesome list for Whisper β an open-source AI-powered speech recognition system developed by OpenAI
β1,579Updated 10 months ago
Alternatives and similar repositories for awesome-whisper:
Users that are interested in awesome-whisper are comparing it to the libraries listed below
- An Open Source text-to-speech system built by inverting Whisper.β4,166Updated 3 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.β996Updated last month
- β8,253Updated 9 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisperβ4,294Updated 2 weeks ago
- Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.β3,805Updated 2 months ago
- Transform audio-visual content into navigable knowledge.β785Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ376Updated 7 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,656Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ639Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface β‘οΈβ341Updated 9 months ago
- β1,118Updated last month
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,578Updated 11 months ago
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokensβ482Updated last year
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JSβ837Updated 5 months ago
- A nearly-live implementation of OpenAI's Whisper.β2,628Updated last month
- A python package to build AI-powered real-time audio applicationsβ1,221Updated last month
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and β¦β247Updated last week
- ποΈ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants ποΈβ1,435Updated this week
- Real time transcription with OpenAI Whisper.β2,624Updated 9 months ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning modβ¦β468Updated this week
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChaβ¦β429Updated this week
- Real time speech to text transcription app.β400Updated 2 years ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,590Updated 8 months ago
- Cross-Platform, GPU Accelerated Whisper ποΈβ1,788Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β14,656Updated last week
- turnkey self-hosted offline transcription and diarization service with llm summaryβ827Updated 6 months ago
- Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)β719Updated last month
- Faster Whisper transcription with CTranslate2β15,079Updated last week
- β1,609Updated this week
- OpenAI Whisper ASR Webservice APIβ2,475Updated last month