FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆39Updated 3 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- ☆40Updated 4 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 3 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆64Updated last month
- A python package for deep multilingual punctuation prediction.☆153Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆146Updated 8 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated 2 years ago
- Various speech datasets made available to the public☆130Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆104Updated last year
- ☆56Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Updated 4 years ago
- ☆158Updated last month
- ☆37Updated last month
- ☆87Updated 5 months ago
- Official implementation of the TTS model Lina-Speech☆175Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆103Updated 9 months ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆79Updated last week
- Repository contains code to fine-tune WhisperASR model☆23Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆185Updated last week
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆189Updated last year
- ☆320Updated last year
- openvino version of openai/whisper☆180Updated 2 years ago