mobiusml / faster-whisper
Faster Whisper ASR transcription with CTranslate2
☆17Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for faster-whisper
- Speaker diarization service☆19Updated this week
- Easy tool that splits given audio based on speaker.☆11Updated 10 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆16Updated 3 weeks ago
- BurrMill core☆21Updated 3 years ago
- Open source cross-platform implementation of MRCP protocol☆18Updated 2 years ago
- Tunable pipelines☆30Updated last month
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆17Updated 3 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- A JAX library for building lattice-based speech transducer models☆40Updated 3 weeks ago
- Experiments with BitNet inference on CPU☆50Updated 7 months ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- proof of concept conversation orchestrator with a speech-language model☆14Updated last month
- ☆20Updated 6 years ago
- ☆9Updated last month
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- ☆12Updated 2 years ago
- On-device speaker diarization powered by deep learning☆25Updated this week
- ☆9Updated 4 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 4 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- ☆20Updated 3 weeks ago
- Normalize Text in Russian☆24Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 9 months ago
- Unofficial implementation of wavenext vocoder☆32Updated 2 months ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆15Updated last month
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆12Updated 2 months ago