metame-ai / faster-distil-whisper
Faster distil-whisper transcription with CTranslate2
☆12Updated last year
Alternatives and similar repositories for faster-distil-whisper:
Users that are interested in faster-distil-whisper are comparing it to the libraries listed below
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆48Updated last month
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆114Updated 2 weeks ago
- ☆255Updated 10 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- This is an optimized implementation of OpenAI's Whisper for multilingual transcription.☆38Updated 2 years ago
- ☆65Updated 2 months ago
- web based editor for subtitles and transcripts☆118Updated 5 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆45Updated 2 years ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆101Updated this week
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Official implementation of the TTS model Lina-Speech☆150Updated 3 weeks ago
- VoiceBox neural network implementation☆100Updated 5 months ago
- ☆90Updated 9 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆83Updated 9 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆126Updated last year
- Tooling to build datasets for audio model training☆16Updated last year
- Create an LJSpeech structured voice dataset on wave input☆24Updated 4 months ago
- G2P☆35Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆91Updated 8 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 3 years ago
- Enhancing Translation with RAG-Powered Large Language Models☆72Updated 2 weeks ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆58Updated 2 years ago
- ☆153Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆39Updated 2 months ago
- whisper.cpp bindings for python☆85Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Triton backend for https://github.com/OpenNMT/CTranslate2☆34Updated last year