DigitLib / whisper-webui-vadLinks
This is the combined forks of two repos to enable OpenAI Whisper large image with VAD for low VRAM GPUs.
☆32Updated 2 years ago
Alternatives and similar repositories for whisper-webui-vad
Users that are interested in whisper-webui-vad are comparing it to the libraries listed below
Sorting:
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆138Updated last week
- web based editor for subtitles and transcripts☆143Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆162Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- whisper.cpp bindings for python☆110Updated 2 years ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆42Updated 2 years ago
- streaming speech to text server using Whisper☆101Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 6 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Updated 3 years ago
- ☆75Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆49Updated 10 months ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- ☆83Updated last year
- ☆47Updated 2 years ago
- Coqui AI TTS plugin☆85Updated 7 months ago
- A curated list of awesome OpenAI's Whisper☆102Updated 2 years ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- ☆100Updated last year
- openvino version of openai/whisper☆182Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Updated 8 months ago
- Record audio or transcribe files using ctranslate2 and whisper!☆170Updated this week
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated 2 years ago
- A browser interface based on the Gradio library for OpenAI's Whisper model.☆43Updated 2 years ago