e-maalouly / Transcription-whisper_pyannote
☆17Updated 2 years ago
Alternatives and similar repositories for Transcription-whisper_pyannote:
Users that are interested in Transcription-whisper_pyannote are comparing it to the libraries listed below
- Whisper combined with Silero VAD, for improved long-form transcriptions☆45Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆140Updated 11 months ago
- streaming speech to text server using Whisper☆85Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆138Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- web based editor for subtitles and transcripts☆119Updated 5 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆100Updated this week
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- A curated list of awesome OpenAI's Whisper☆96Updated last year
- Convert epub file to txt☆28Updated last year
- Simple Diarization model☆46Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆294Updated 2 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated last year
- ☆35Updated 2 years ago
- OpenAI Whisper Prompt Examples☆49Updated last year
- openvino version of openai/whisper☆164Updated last year
- 📈 A forced aligner intended for synchronization of narrated text☆87Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last month
- ☆154Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆76Updated last month
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 8 months ago
- Tunable pipelines☆31Updated last week
- Google's SoundStorm: Efficient Parallel Audio Generation☆129Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆47Updated last month
- A TextTiling-based algorithm for text segmentation (aka topic segmentation) that uses neural sentence encoders, as well as extractive sum…☆44Updated last year