mobiusml / faster-whisper
Faster Whisper ASR transcription with CTranslate2
☆20Updated 5 months ago
Alternatives and similar repositories for faster-whisper:
Users that are interested in faster-whisper are comparing it to the libraries listed below
- Speaker diarization service☆21Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Audio tokenization, in the fastest way possible!☆50Updated 7 months ago
- Open TTS models, built for streaming on the edge☆39Updated last month
- zero-shot realtime TTS system, fully offline, free and open source☆34Updated 2 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- ☆10Updated last week
- ☆10Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Tunable pipelines☆32Updated last month
- ☆11Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆27Updated 3 weeks ago
- Create an LJSpeech structured voice dataset on wave input☆27Updated 6 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 6 months ago
- ☆26Updated 2 months ago
- Acoustic Neighbor Embeddings☆21Updated 4 months ago
- A curated list of awesome voice activity detection☆48Updated 4 months ago
- Speaker Diarization with Transformers☆64Updated 10 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆12Updated 5 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 3 years ago
- Experiments with BitNet inference on CPU☆53Updated last year
- ☆86Updated last week
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 8 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 7 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆23Updated this week
- ☆22Updated 3 years ago
- ☆20Updated 6 years ago