abus-aikorea / voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
☆866Updated this week
Related projects ⓘ
Alternatives and complementary repositories for voice-pro
- gradio WebUI for AdvancedLivePortrait☆316Updated this week
- Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and suppo…☆969Updated this week
- Synchronized Translation for Videos. Video dubbing☆869Updated 3 weeks ago
- [NeurIPS 24] PromptFix: You Prompt and We Fix the Photo☆506Updated last month
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆315Updated last month
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆854Updated last week
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆363Updated 2 weeks ago
- An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos…☆503Updated 4 months ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆214Updated last week
- Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration☆532Updated last month
- 实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, su…☆413Updated this week
- A chrome extension to easily do visual trials of clothing from any e-commerce store. Here is the easy to use install option 👇☆720Updated this week
- A realtime live transcription and translation app built with Huggingface Transformer.js and Supabase Realtime.☆415Updated last month
- 🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手,无需GPU一键高质量字幕视频合成!支持生成、断句、优化、翻译全流程。让视频字幕制作简单高效!☆556Updated this week
- A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama☆1,375Updated last week
- Voice Transformation for Videos. 🎤👄🎬☆217Updated last month
- ☆499Updated 3 weeks ago
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆958Updated 2 months ago
- This is a study aim to transfer the single concept by using DIT model self-attention capablity☆285Updated this week
- [NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation☆772Updated 3 weeks ago
- 快速提取音视频内容,整理成一份结构化的markdown笔记☆1,099Updated 3 months ago
- a comfyui custom node for MimicMotion☆339Updated 3 months ago
- A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them f…☆940Updated 2 months ago
- ☆575Updated 7 months ago
- MemFree - Hybrid AI Search Engine & AI Page Generator☆1,046Updated this week
- ⚡ Insanely fast AI voice assistant with <500ms response times☆313Updated 2 months ago
- Local SRT/LLM/TTS Voicechat☆544Updated last month
- ☆1,873Updated 3 months ago
- Awesome Digital Human☆931Updated last week
- face-to-sticker☆623Updated 8 months ago