FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆38Updated 2 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 11 months ago
- ☆39Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆64Updated this week
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆127Updated 4 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆176Updated this week
- A model that predicts the punctuation of English, Italian, French and German texts.☆78Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆87Updated 5 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆178Updated last year
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆131Updated last year
- Official implementation of the TTS model Lina-Speech☆170Updated 8 months ago
- ☆56Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- ☆128Updated last week
- Simple diarization model☆52Updated 3 months ago
- Open TTS models, built for streaming on the edge☆43Updated 6 months ago
- ☆37Updated 5 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 4 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆149Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆257Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆164Updated last year
- openvino version of openai/whisper☆175Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last week
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆40Updated this week
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 4 years ago