e-maalouly / Transcription-whisper_pyannote
☆17Updated 2 years ago
Alternatives and similar repositories for Transcription-whisper_pyannote:
Users that are interested in Transcription-whisper_pyannote are comparing it to the libraries listed below
- ez audio transcription tool with flexible processing and post-processing options☆142Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆46Updated 2 years ago
- web based editor for subtitles and transcripts☆118Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆109Updated last year
- Convert epub file to txt☆30Updated last year
- openvino version of openai/whisper☆165Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated last year
- streaming speech to text server using Whisper☆86Updated last year
- ☆35Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆52Updated last year
- whisper.cpp bindings for python☆86Updated last year
- A full-text search for YouTube subtitles and video metadata with a command line interface.☆28Updated last week
- ONNX implementation of Whisper. PyTorch free.☆92Updated 2 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- generate granular word-level captions in srt format☆57Updated 2 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.☆186Updated 2 years ago
- Simple Diarization model☆47Updated last year
- Package for inference for punctuation, true-casing, and sentence boundary detection☆24Updated 8 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- ☆21Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆83Updated last week
- Speaker Diarization with Transformers☆64Updated 8 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Buildings block for voice-enabled applications in the browser☆34Updated last week
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Model☆57Updated last year