bungerr / faster-whisper-3Links

Faster Whisper transcription with CTranslate2

☆8

Alternatives and similar repositories for faster-whisper-3

Users that are interested in faster-whisper-3 are comparing it to the libraries listed below

Sorting:

metame-ai / faster-distil-whisper
Faster distil-whisper transcription with CTranslate2
☆14Updated last year
taylorchu / 2cent-tts
☆22Updated this week
mesolitica / vllm-whisper
A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper
☆28Updated 11 months ago
ag1988 / mel-asr
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆19Updated 8 months ago
ORI-Muchim / PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆76Updated last year
alphacep / whisper-prompts
OpenAI Whisper Prompt Examples
☆52Updated last year
gooofy / zerovox
zero-shot realtime TTS system, fully offline, free and open source
☆41Updated 2 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆99Updated 8 months ago
aholab / AhoTTS
Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…
☆17Updated last year
speechmatics / ctranslate2_triton_backend
Triton backend for https://github.com/OpenNMT/CTranslate2
☆35Updated last year
EMRAI / emrai-synthetic-diarization-corpus
☆20Updated 6 years ago
taresh18 / TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆90Updated last month
jumon / zac
Zero-shot Audio Classification using Whisper
☆79Updated 2 years ago
KoljaB / stream2sentence
Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.
☆63Updated last week
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆25Updated last year
leohuang2013 / pyannote-audio_speaker-diarization_cpp
C++ version of pyannote audio speaker diarizaiton pipeline
☆21Updated last year
ccoreilly / deepspeech-catala
Deepspeech ASR Model for the Catalan Language
☆17Updated 4 years ago
indri-voice / audiotoken
Audio tokenization, in the fastest way possible!
☆52Updated 10 months ago
speechcatcher-asr / speechcatcher-data
☆11Updated 3 weeks ago
mobiusml / faster-whisper
Faster Whisper ASR transcription with CTranslate2
☆22Updated 8 months ago
thorstenMueller / Audio-to-Voice-Dataset
Create an LJSpeech structured voice dataset on wave input
☆30Updated 9 months ago
Leikoe / torch_to_ggml
convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible
☆14Updated last year
duerig / StyleTTS2
StyleTTS 2 Optimized Training Fork
☆31Updated 4 months ago
kdrkdrkdr / JA2ML-VITS
Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)
☆3Updated last year
ANonEntity / WhisperWithVAD
Whisper combined with Silero VAD, for improved long-form transcriptions
☆52Updated 2 years ago
skrbnv / javad
☆56Updated 5 months ago
jamesparsloe / llm.speech
Trying to build an all in one speech-text language model - a bit like GPT-4o
☆22Updated last year
hitz-zentroa / whisper-lm
Add n-gram and large language model (LLM) support to Whisper models.
☆26Updated last month
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆43Updated 3 months ago
dscripka / openSpeechToIntent
A simple, but performant framework for mapping speech directly to categories and intents.
☆20Updated 10 months ago