FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆38Updated 2 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- ☆38Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆117Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆121Updated 3 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆171Updated 2 weeks ago
- ☆120Updated last week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- Convert English text from written expressions into spoken forms☆26Updated 3 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆61Updated 2 years ago
- ☆56Updated 2 years ago
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 2 months ago
- Various speech datasets made available to the public☆128Updated 8 months ago
- ☆28Updated 2 months ago
- Simple diarization model☆52Updated 2 months ago
- Batch Support for OpenAI Whisper☆95Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆148Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A Hackable speech recognition library.☆25Updated 10 months ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆159Updated last week
- Use quantized versions of Whisper to speed up inference☆12Updated 10 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆58Updated 8 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆74Updated 4 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆176Updated last year
- ☆84Updated 3 weeks ago
- A TTS model that makes a speaker speak new languages☆76Updated last year