FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆39Updated 3 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- ☆40Updated 4 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 3 years ago
- Putting flows on top of neural transducers for better TTS☆65Updated 2 weeks ago
- Use quantized versions of Whisper to speed up inference☆12Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆106Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Updated 4 years ago
- ☆56Updated 3 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆147Updated 8 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- ☆173Updated this week
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 4 years ago
- A merged version of multiple open-source German speech datasets.☆34Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ☆86Updated 6 months ago
- Official implementation of the TTS model Lina-Speech☆176Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- Various speech datasets made available to the public☆130Updated last year
- Convert English text from written expressions into spoken forms☆28Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆58Updated 2 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆78Updated 3 weeks ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆189Updated last week
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- ☆37Updated 2 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆104Updated 10 months ago