bungerr / faster-whisper-3
Faster Whisper transcription with CTranslate2
☆8Updated last year
Alternatives and similar repositories for faster-whisper-3:
Users that are interested in faster-whisper-3 are comparing it to the libraries listed below
- Faster distil-whisper transcription with CTranslate2☆13Updated last year
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 8 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆34Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- whisper.cpp bindings for python☆94Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆59Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 10 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆48Updated last week
- High quality text-to-speech based on StyleTTS 2.☆36Updated this week
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- This is an optimized implementation of OpenAI's Whisper for multilingual transcription.☆38Updated 2 years ago
- Your one-stop solution for voice dataset creation☆119Updated last year
- MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK☆12Updated last year
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Updated last year
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 2 weeks ago
- Open TTS models, built for streaming on the edge☆39Updated last month
- Evaluation of STT models for german language☆15Updated 3 years ago
- Faster Whisper ASR transcription with CTranslate2☆20Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- ☆10Updated this week
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- a Frontier Japanese Speech Generation net☆31Updated last month
- Experiments with BitNet inference on CPU☆53Updated last year
- The Vokan Architecture (Tsukasa speech based)☆9Updated 2 months ago