AndreMarkert / whisper-webuiLinks

A browser interface based on the Gradio library for OpenAI's Whisper model.

☆42

Alternatives and similar repositories for whisper-webui

Users that are interested in whisper-webui are comparing it to the libraries listed below

Sorting:

geekodour / wscribe
ez audio transcription tool with flexible processing and post-processing options
☆155Updated last year
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆117Updated 2 years ago
themanyone / whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
☆256Updated last month
rudymohammadbali / Whisper-Transcriber
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.
☆56Updated 11 months ago
awexandrr / audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
☆119Updated last year
BBC-Esq / Faster-Whisper-Transcriber
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
☆133Updated this week
kanttouchthis / text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui
☆155Updated last year
ancs21 / awesome-openai-whisper
A curated list of awesome OpenAI's Whisper
☆101Updated last year
rsxdalv / bark-speaker-directory
Site for sharing Bark voices
☆51Updated 4 months ago
wsippel / bark_tts
Oobabooga extension for Bark TTS
☆119Updated last year
Leikoe / torch_to_ggml
convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible
☆15Updated last year
geekodour / wscribe-editor
web based editor for subtitles and transcripts
☆137Updated 11 months ago
mateogon / pdf-narrator
Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…
☆104Updated 3 months ago
ChobPT / oobaboogas-webui-langchain_agent
Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work
☆74Updated last year
Cerlancism / faster-whisper-webui
Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…
☆18Updated last year
matatonic / openedai-whisper
An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.
☆82Updated 5 months ago
lee-b / kobold_assistant
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …
☆160Updated 11 months ago
dynamiccreator / whisper-typer-tool
This is a python script using whisper to type with your voice
☆58Updated last week
nalbion / whisper-server
streaming speech to text server using Whisper
☆93Updated 2 years ago
ouoertheo / silero-api-server
☆70Updated 9 months ago
revdotcom / reverb-self-hosted
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆53Updated 7 months ago
mldljyh / whisper_real_time_translation
The subtitles and translations are generated in real-time and displayed as pop-ups.
☆171Updated 2 years ago
gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆137Updated last week
JarodMica / StyleTTS-WebUI
☆67Updated 4 months ago
VideotronicMaker / LM-Studio-Voice-Conversation
Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…
☆102Updated last year
BuffMcBigHuge / text-generation-webui-edge-tts
A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.
☆41Updated last year
NeuralVox / StyleTTS2
☆97Updated last year
hoof-ai / hoof
"Just hoof it!" - A spotlight like interface to Ollama
☆62Updated last year
JSchmie / ScrAIbe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
☆50Updated 6 months ago
Uminosachi / open-llm-webui
This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).
☆43Updated 4 months ago