DigitLib / whisper-webui-vadLinks

This is the combined forks of two repos to enable OpenAI Whisper large image with VAD for low VRAM GPUs.

☆33

Alternatives and similar repositories for whisper-webui-vad

Users that are interested in whisper-webui-vad are comparing it to the libraries listed below

Sorting:

winstxnhdw / nllb-api
A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.
☆120Updated this week
geekodour / wscribe
ez audio transcription tool with flexible processing and post-processing options
☆156Updated last year
geekodour / wscribe-editor
web based editor for subtitles and transcripts
☆137Updated 11 months ago
mirix / approaches-to-diarisation
A testing repo to share code and thoughts on diarisation
☆55Updated last year
awexandrr / audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
☆119Updated last year
NeonGeckoCom / neon-tts-plugin-coqui
Coqui AI TTS plugin
☆85Updated last month
JonathanFly / faster-whisper-livestream-translator
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
☆79Updated 2 years ago
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆117Updated 2 years ago
camenduru / coqui-XTTS-colab
☆83Updated last year
ancs21 / awesome-openai-whisper
A curated list of awesome OpenAI's Whisper
☆101Updated last year
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆161Updated last year
nalbion / whisper-server
streaming speech to text server using Whisper
☆94Updated 2 years ago
amrrs / openai-whisper-webapp
Code for OpenAI Whisper Web App Demo
☆93Updated 2 years ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆223Updated 3 months ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆96Updated last year
zhuzilin / whisper-openvino
openvino version of openai/whisper
☆170Updated last year
coqui-ai / STT-models
Open models for Coqui STT
☆141Updated 2 years ago
Mastering-Python-GT / Transcription-diarization-whisper-pyannote
Transcription and diarization (speaker identification)
☆33Updated 2 years ago
camenduru / whisper-jax-colab
☆47Updated last year
Vaibhavs10 / translate-with-whisper
☆158Updated 2 years ago
ANonEntity / WhisperWithVAD
Whisper combined with Silero VAD, for improved long-form transcriptions
☆52Updated 2 years ago
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆189Updated last year
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆120Updated last year
coqui-ai / Trainer
🐸 - A general purpose model trainer, as flexible as it gets
☆222Updated last year
EtienneAb3d / WhisperTimeSync
Synchronize Whisper's timestamps over an existing accurate transcription
☆154Updated last year
fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆43Updated last year
coqui-ai / xtts-streaming-server
☆337Updated last year
Wordcab / wordcab-transcribe
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆215Updated 9 months ago
ProjectEGU / whisper-for-low-vram
Robust Speech Recognition via Large-Scale Weak Supervision
☆29Updated last year
carloscdias / whisper-cpp-python
whisper.cpp bindings for python
☆100Updated last year