AndreMarkert / whisper-webuiLinks
A browser interface based on the Gradio library for OpenAI's Whisper model.
☆43Updated 2 years ago
Alternatives and similar repositories for whisper-webui
Users that are interested in whisper-webui are comparing it to the libraries listed below
Sorting:
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- web based editor for subtitles and transcripts☆142Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Oobabooga extension for Bark TTS☆119Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆278Updated last week
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- A curated list of awesome OpenAI's Whisper☆99Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆162Updated this week
- Site for sharing Bark voices☆51Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆62Updated 11 months ago
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆159Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆42Updated last year
- ☆72Updated 4 months ago
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆43Updated 8 months ago
- Coqui AI TTS plugin☆85Updated 5 months ago
- A chrome extention for quering a local llm model using llama-cpp-python, includes a pip package for running the server, 'pip install loca…☆18Updated 2 years ago
- A simple TTS server for generating speech using StyleTTS2☆37Updated last year
- OpenAI Whisper API-style local server, runnig on FastAPI☆87Updated 2 months ago
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆64Updated 2 months ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Updated 2 years ago
- A modern GUI application that transcribes and translates audio and video files, offering the option to save the subtitles as separate fil…☆15Updated 2 years ago
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆43Updated 6 months ago
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆96Updated 9 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Audio datasets, easier.☆86Updated 2 years ago