Eddycrack864 / UVR5-UI
Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models
☆203Updated this week
Related projects ⓘ
Alternatives and complementary repositories for UVR5-UI
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆117Updated 7 months ago
- A toolkit for speaker diarization.☆137Updated 2 weeks ago
- A lightweight end-to-end text-to-speech model☆90Updated last month
- gradio WebUI for AdvancedLivePortrait☆125Updated this week
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech,…☆304Updated this week
- A diffusers pipeline for zero shot stylised couples portrait creation☆90Updated last month
- Open source inference code for Rev's model☆329Updated last week
- Voice Transformation for Videos. 🎤👄🎬☆216Updated 3 weeks ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆359Updated this week
- ☆98Updated last week
- Interface for OuteTTS models.☆277Updated this week
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆61Updated last month
- ☆220Updated 4 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆454Updated 3 months ago
- SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild☆88Updated 8 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆79Updated 6 months ago
- Add caption to any video☆174Updated 9 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆233Updated 3 weeks ago
- Bring portraits to life via Monitor!☆255Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆249Updated 2 months ago
- The fastest digital human algorithm, now on your desktop.☆248Updated this week
- video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换☆109Updated 6 months ago
- DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyn…☆312Updated last week
- Turn anyone into another image☆224Updated 7 months ago
- Common tasks in a single model☆33Updated 9 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆55Updated this week
- Separate stems (vocals, bass, drums, other) from audio. Recombine, tempo match, slice/crop audio☆163Updated 2 months ago
- Have a natural voice conversation with an LLM☆222Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆126Updated 2 months ago
- AMT-APC: AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model☆53Updated this week