FamousDirector / FastWhisperLinks
This is an optimized implementation of OpenAI's Whisper for multilingual transcription.
☆38Updated 2 years ago
Alternatives and similar repositories for FastWhisper
Users that are interested in FastWhisper are comparing it to the libraries listed below
Sorting:
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- Putting flows on top of neural transducers for better TTS☆64Updated last week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- ☆39Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆130Updated 5 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆176Updated this week
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆68Updated 2 months ago
- ☆145Updated last week
- Various speech datasets made available to the public☆131Updated 10 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆149Updated last year
- Official implementation of the TTS model Lina-Speech☆170Updated 9 months ago
- ☆358Updated last year
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆133Updated last year
- Speaker Diarization with Transformers☆69Updated 4 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- ☆56Updated 2 years ago
- ☆310Updated last year
- ☆87Updated 2 months ago
- ONNX Inference of Pyannote Segmentation☆94Updated 9 months ago
- Simple diarization model☆52Updated 4 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆326Updated 11 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆53Updated 2 years ago
- ☆37Updated 5 months ago
- Very fast, accurate speaker diarization☆150Updated last week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆127Updated 2 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last month
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated 2 years ago