effusiveperiscope / so-vits-svcView external linksLinks
so-vits-svc
☆175Oct 20, 2025Updated 3 months ago
Alternatives and similar repositories for so-vits-svc
Users that are interested in so-vits-svc are comparing it to the libraries listed below
Sorting:
- Colaboratory Notebook for Ultimate Vocal Remover☆102Jan 16, 2026Updated last month
- Core Engine of Singing Voice Conversion & Singing Voice Clone☆2,847Apr 23, 2024Updated last year
- Platform and API Agnostic library for powering chatbots☆24Feb 27, 2023Updated 2 years ago
- discord bot using AI to generate images based on discord messages☆11Oct 10, 2023Updated 2 years ago
- so-vits-svc fork with realtime support, improved interface and more features.☆9,267Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- Sentencepiece based BPE tokenizer for English and Japanese language text.☆28Apr 4, 2024Updated last year
- ☆46Jul 23, 2025Updated 6 months ago
- ☆16Dec 12, 2023Updated 2 years ago
- A simple GUI application that slices audio with silence detection☆1,437Jul 29, 2024Updated last year
- ☆19Jul 31, 2024Updated last year
- This is a HeadSwap project not only face☆34Dec 28, 2022Updated 3 years ago
- Home of the Chunkmogrify project☆16Jan 11, 2022Updated 4 years ago
- A chat-like interface for Stable Diffusion☆48Dec 30, 2022Updated 3 years ago
- vits☆21Mar 29, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Forked from https://huggingface.co/spaces/aadnk/faster-whisper-webui CLI to support running both transcribe and translate tasks or differ…☆18Dec 30, 2023Updated 2 years ago
- GPT2 Byte Pair Encoding implementation in Golang☆24Jul 9, 2025Updated 7 months ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆46Aug 14, 2023Updated 2 years ago
- A re-implementation of Stable-Diffusion using better code pratices with faster and lower-memory usage.☆45Feb 8, 2023Updated 3 years ago
- A TriposR implementation for WebUI☆60Mar 13, 2024Updated last year
- ☆19Feb 27, 2023Updated 2 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- ☆32Apr 10, 2023Updated 2 years ago
- ☆28Nov 15, 2023Updated 2 years ago
- Stability.AI Model Metadata Standard Specification☆70Jun 4, 2024Updated last year
- Fast finetuning using a booster model that puts the initial state to a local minimum☆113Aug 29, 2023Updated 2 years ago
- The homepage of LongCat-Video-Avatar☆141Dec 18, 2025Updated last month
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆26Oct 9, 2021Updated 4 years ago
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆34Dec 15, 2024Updated last year
- GUI Wrapper for 'A TensorFlow Implementation of DC-TTS: yet another text-to-speech model'☆26Jul 16, 2020Updated 5 years ago
- [ECCV2024 offical]KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding☆34Jul 12, 2024Updated last year
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- ☆27Mar 30, 2023Updated 2 years ago
- Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch☆33Dec 15, 2023Updated 2 years ago
- An extension to allow managing custom depth inputs to Stable Diffusion depth2img models for the stable-diffusion-webui repo.☆72Feb 4, 2023Updated 3 years ago
- This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.☆92Jan 25, 2026Updated 3 weeks ago
- Tools to train a generative model on arbitrary audio samples☆1,111Apr 29, 2024Updated last year