FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆38Updated 2 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- ☆38Updated 3 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- ☆27Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆115Updated last month
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆170Updated last month
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- ☆37Updated 2 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆172Updated last year
- ☆104Updated 2 weeks ago
- Various speech datasets made available to the public☆123Updated 7 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 10 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- A python package for deep multilingual punctuation prediction.☆127Updated 10 months ago
- A TTS model that makes a speaker speak new languages☆76Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 7 months ago
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 9 months ago
- ☆81Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆68Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆104Updated 5 months ago
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Text-to-Speech Latency Benchmark☆16Updated 3 weeks ago
- ☆46Updated 2 years ago
- Official implementation of the TTS model Lina-Speech☆166Updated 6 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year