DigitLib / whisper-webui-vadLinks
This is the combined forks of two repos to enable OpenAI Whisper large image with VAD for low VRAM GPUs.
☆33Updated 2 years ago
Alternatives and similar repositories for whisper-webui-vad
Users that are interested in whisper-webui-vad are comparing it to the libraries listed below
Sorting:
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆125Updated this week
- A testing repo to share code and thoughts on diarisation☆56Updated last year
- web based editor for subtitles and transcripts☆140Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- ☆99Updated last year
- Transcription and diarization (speaker identification)☆33Updated 2 years ago
- ☆40Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆65Updated 10 months ago
- ☆47Updated last year
- ☆83Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 9 months ago
- A lightweight end-to-end text-to-speech model☆119Updated 6 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- Coqui AI TTS plugin☆87Updated 2 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 5 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆69Updated 2 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- whisper.cpp bindings for python☆101Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆155Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆234Updated last month
- Whisper combined with Silero VAD, for improved long-form transcriptions☆53Updated 2 years ago
- A curated list of awesome OpenAI's Whisper☆101Updated 2 years ago