DigitLib / whisper-webui-vadLinks
This is the combined forks of two repos to enable OpenAI Whisper large image with VAD for low VRAM GPUs.
☆32Updated 2 years ago
Alternatives and similar repositories for whisper-webui-vad
Users that are interested in whisper-webui-vad are comparing it to the libraries listed below
Sorting:
- web based editor for subtitles and transcripts☆142Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆135Updated this week
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- A testing repo to share code and thoughts on diarisation☆57Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆87Updated 3 months ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- ☆158Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- ☆83Updated last year
- A curated list of awesome OpenAI's Whisper☆101Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆159Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Updated 3 years ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆167Updated 2 weeks ago
- openvino version of openai/whisper☆180Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- Speech to text to speech using Elevenlabs☆28Updated 2 years ago
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆64Updated 3 months ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆246Updated 4 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 5 months ago
- Transcription and diarization (speaker identification)☆34Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago