FamousDirector / FastWhisper
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆38Updated 2 years ago
Alternatives and similar repositories for FastWhisper:
Users that are interested in FastWhisper are comparing it to the libraries listed below
- ☆38Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 4 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆109Updated 2 years ago
- ☆73Updated this week
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆33Updated last week
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 6 months ago
- Finetuning VITS Efficiently☆32Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆82Updated 2 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- OpenAI Whisper Prompt Examples☆52Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆59Updated 3 weeks ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆153Updated this week
- A Hackable speech recognition library.☆25Updated 4 months ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Speaker Diarization with Transformers☆64Updated 9 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆54Updated 3 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆47Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- ☆56Updated 2 years ago
- ☆22Updated last month
- Forced alignment decoder for Whisper.☆14Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 7 months ago
- ☆34Updated this week
- ☆19Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆52Updated last year