pselvana / VoiceCrafterLinks
Dockerized Voicecraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆17Updated last year
Alternatives and similar repositories for VoiceCrafter
Users that are interested in VoiceCrafter are comparing it to the libraries listed below
Sorting:
- ☆19Updated 10 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 6 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆107Updated last week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆38Updated last year
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆55Updated last year
- SoTA open-source TTS for Audiobook and Podcast Generation☆170Updated 5 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 2 months ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆32Updated 11 months ago
- Gradio UI for YuE☆78Updated 7 months ago
- A collection of handy helpers for AI art generation, AI writing and other experimental tools☆52Updated last year
- ☆21Updated 7 months ago
- OminiControl for the GPU Poor☆39Updated 9 months ago
- ☆19Updated last year
- Run Stable diffusion 3 on low VRAM systems☆28Updated last year
- ☆14Updated last year
- ☆40Updated last year
- ☆72Updated 8 months ago
- ☆23Updated last year
- ☆17Updated 8 months ago
- ☆16Updated 2 years ago
- Blender add-on for AI generating 2D assets for visualizations, using FLUX and BiRefNet☆56Updated 7 months ago
- ☆14Updated last year
- Cosmos1GP for the GPU Poor by DeepBeepMeep☆81Updated 9 months ago
- ComfyUI integration for Unreal Engine 5☆42Updated 2 months ago
- ComfyUI workflows☆67Updated this week
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆33Updated last month
- An interactive LUT Maker for visual artists☆48Updated 5 months ago
- ☆12Updated last year
- Generate 3D meshes from a single 2D image using TripoSR, complete with manual geometry editing and texture baking support☆56Updated last year
- ToonOut, a fork of BiRefNet focused on background removal for anime images. We open-source our dataset & our weights. See our paper at: h…☆70Updated 2 months ago