mobiusml / faster-whisper
Faster Whisper ASR transcription with CTranslate2
☆20Updated 6 months ago
Alternatives and similar repositories for faster-whisper:
Users that are interested in faster-whisper are comparing it to the libraries listed below
- ☆24Updated last week
- Open TTS models, built for streaming on the edge☆41Updated last month
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆48Updated 3 weeks ago
- Speaker diarization service☆21Updated 3 weeks ago
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆35Updated 3 weeks ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆24Updated 2 weeks ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 5 months ago
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 6 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆49Updated 2 years ago
- Open-source and reproducible benchmarks for Speaker Diarization☆23Updated 3 weeks ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆101Updated 3 weeks ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Collection of Open Source Speech Data☆153Updated 6 months ago
- ☆53Updated 9 months ago
- ☆11Updated 2 months ago
- ☆210Updated last month
- ☆25Updated last month
- ☆78Updated last year
- Speaker Diarization with Transformers☆64Updated 11 months ago
- Experiments with BitNet inference on CPU☆54Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆35Updated 2 weeks ago
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆14Updated 2 weeks ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year