AndreMarkert / whisper-webuiLinks
A browser interface based on the Gradio library for OpenAI's Whisper model.
☆42Updated 2 years ago
Alternatives and similar repositories for whisper-webui
Users that are interested in whisper-webui are comparing it to the libraries listed below
Sorting:
- ☆69Updated 8 months ago
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆18Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆154Updated last year
- A custom extension for AUTOMATIC1111/stable-diffusion-webui to extend rest APIs to do some local operations, using in StableStudio.☆47Updated 2 years ago
- An API for VoiceCraft.☆25Updated last year
- Site for sharing Bark voices☆51Updated 3 months ago
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆74Updated 11 months ago
- Docker image for the Text Generation Web UI: A Gradio web UI for Large Language Models with support for multiple inference backends.☆2Updated last week
- A multi-voice TTS system trained with an emphasis on quality☆26Updated 2 years ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated 11 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Transcribe with ease :D☆16Updated 2 years ago
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- A simple extension that uses Bark Text-to-Speech for audio output☆34Updated last year
- ☆83Updated 11 months ago
- ☆47Updated last year
- Oobabooga extension for Bark TTS☆119Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆96Updated 2 weeks ago
- ☆54Updated last year
- Audio datasets, easier.☆84Updated last year
- ☆67Updated 3 months ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆55Updated last year
- A SwarmUI extension that adds parameters for ReActor to the the generate tab☆22Updated last month
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆68Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆52Updated 3 weeks ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆70Updated 2 years ago
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆68Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆45Updated 3 months ago