DigitLib / whisper-webui-vad
This is the combined forks of two repos to enable OpenAI Whisper large image with VAD for low VRAM GPUs.
☆34Updated 2 years ago
Alternatives and similar repositories for whisper-webui-vad:
Users that are interested in whisper-webui-vad are comparing it to the libraries listed below
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆108Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- ☆48Updated last year
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆19Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆78Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- ☆95Updated 11 months ago
- ☆83Updated 9 months ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆118Updated last year
- Coqui AI TTS plugin☆74Updated last month
- A testing repo to share code and thoughts on diarisation☆55Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆21Updated last week
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated last month
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 11 months ago
- A browser interface based on the Gradio library for OpenAI's Whisper model.☆40Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆63Updated last year
- A simple TTS server for generating speech using StyleTTS2☆38Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- canvas-based talking head model using viseme data☆30Updated last year
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 8 months ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year