spicytigermeat / neuTalk
Open Source Text-to-Speech GUI Tool running on TalkNet
☆11Updated 2 years ago
Alternatives and similar repositories for neuTalk:
Users that are interested in neuTalk are comparing it to the libraries listed below
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆31Updated 5 months ago
- DiffSinger training colab notebook to make training easier hopefully☆38Updated 3 weeks ago
- liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project☆33Updated last year
- Python scripts I made to make NNSVS labeling easier.☆23Updated last year
- Extension program for DIFF-SVC to make it more easy to use☆16Updated 2 years ago
- AudioSR-Colab-Fork☆38Updated last month
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- The Original Support for English NNSVS Dataset Creation☆26Updated 3 months ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Real-time end-to-end singing voice convertion☆19Updated 3 months ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- Python script to convert NNSVS DBs to Diffsinger without the NNSVS Python Library☆23Updated 3 months ago
- Misc. tools/scripts that I made to use for tortoise☆22Updated 5 months ago
- a CustomTkInter GUI for processing and training DiffSinger models☆15Updated this week
- Heteronym to Phoneme Parser☆18Updated last year
- UST入りの歌唱DBからENUNU用モデルを生成するツール☆21Updated 2 years ago
- MMD2depth use MikuMikuDance model in Stable Diffusion 2.0 depth2img☆29Updated 2 years ago
- singing voice conversion without f0☆23Updated last year
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".☆16Updated 4 years ago
- Ultimate Vocal Remover CLI type for Google Colab☆50Updated 3 weeks ago
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated 9 months ago
- Zero-Shot Emotion Style Transfer☆41Updated 10 months ago
- TU Darmstadt - Deep Learning: Architectures & Methods Project SS21☆31Updated last month
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 9 months ago
- Official Implementation of StyleTTS-VC☆175Updated last month
- A collection of neural vocoders suitable for singing voice synthesis tasks.☆106Updated 3 weeks ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆121Updated 2 months ago
- ☆28Updated last year