abus-aikorea / voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
☆374Updated this week
Related projects ⓘ
Alternatives and complementary repositories for voice-pro
- gradio WebUI for AdvancedLivePortrait☆210Updated this week
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆312Updated last month
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆203Updated this week
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆359Updated this week
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆63Updated last month
- ☆220Updated 4 months ago
- Open source inference code for Rev's model☆331Updated 2 weeks ago
- Local SRT/LLM/TTS Voicechat☆535Updated last month
- Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration☆516Updated 3 weeks ago
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆839Updated this week
- ☆486Updated 2 weeks ago
- a comfyui custom node for MimicMotion☆335Updated 3 months ago
- An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos…☆498Updated 4 months ago
- Bring portraits to life via Monitor!☆255Updated 3 months ago
- ☆99Updated last week
- Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and suppo…☆765Updated this week
- Voice Transformation for Videos. 🎤👄🎬☆216Updated last month
- The fastest digital human algorithm, now on your desktop.☆256Updated this week
- ⚡ Insanely fast AI voice assistant with <500ms response times☆302Updated 2 months ago
- Interface for OuteTTS models.☆317Updated this week
- A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them f…☆902Updated last month
- Integrate LLM's into your OS. For any issues or ideas, message us in the discord server below!☆138Updated 2 months ago
- Have a natural voice conversation with an LLM☆222Updated this week
- Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses l…☆335Updated this week
- Unofficial Implementation of Animate Anyone by Novita AI☆751Updated 5 months ago
- EZ-Work AI文 档翻译,人人可用的开源AI文档翻译助手,可以快速低成本调用OpenAI等大语言模型api,帮助您实现txt/markdown/word/csv/excel/pdf/ppt的文档翻译。☆122Updated this week
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆512Updated 2 months ago
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆945Updated 2 months ago