FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆38Updated 2 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- ☆25Updated last week
- ☆38Updated 3 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- ☆103Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 6 months ago
- Speaker Diarization with Transformers☆64Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated last week
- StyleTTS 2 Optimized Training Fork☆29Updated 4 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Use quantized versions of Whisper to speed up inference☆12Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 3 weeks ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Tunable pipelines☆34Updated 3 months ago
- Open TTS models, built for streaming on the edge☆43Updated 2 months ago
- High quality text-to-speech based on StyleTTS 2.☆48Updated last week
- ☆79Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆109Updated 2 weeks ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- ☆17Updated 4 years ago
- Audio tokenization, in the fastest way possible!☆52Updated 9 months ago
- ☆63Updated last year
- ☆36Updated last month
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 9 months ago
- Simple diarization model☆49Updated last year
- ☆26Updated 4 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆163Updated 3 weeks ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆23Updated 2 months ago