AI based singing voice synthesis
☆37Jun 10, 2024Updated last year
Alternatives and similar repositories for auris_experimental_vits_dsp
Users that are interested in auris_experimental_vits_dsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- Real-time end-to-end singing voice convertion☆25Nov 3, 2024Updated last year
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆55Sep 25, 2023Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Mar 20, 2024Updated 2 years ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆21May 2, 2023Updated 3 years ago
- 声質変換 VST☆70Oct 18, 2025Updated 6 months ago
- ☆15Nov 10, 2025Updated 5 months ago
- ☆28Oct 28, 2023Updated 2 years ago
- RVCで音声学習をするための便利スクリプト集☆26Apr 8, 2023Updated 3 years ago
- 44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…☆39Jun 2, 2023Updated 2 years ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated last year
- Aivis Voice Model File (.aivm/.aivmx) Generator / Editor☆15Feb 5, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 💠 Aivis: AI Voice Imitation System☆27Feb 25, 2024Updated 2 years ago
- ☆36May 1, 2025Updated last year
- ☆149Sep 8, 2025Updated 7 months ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- ☆15Apr 2, 2025Updated last year
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Aivis Voice Model File (.aivm/.aivmx) Utility Library☆25Oct 17, 2025Updated 6 months ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- jp-localization☆48Apr 11, 2023Updated 3 years ago
- Infer only tts☆47Jan 6, 2026Updated 4 months ago
- a Frontier Japanese Speech Generation net☆64May 15, 2025Updated 11 months ago
- ☆15Nov 11, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- A vector similarity search engine for humans🥳☆18Oct 30, 2023Updated 2 years ago
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆16Sep 7, 2025Updated 7 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆65Sep 22, 2025Updated 7 months ago
- Official Repository of UltraVoice☆61Oct 28, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆40Jul 15, 2025Updated 9 months ago
- 日本語TTS(VITS)の学習と音声合成のGradio WebUI☆42Jan 5, 2024Updated 2 years ago
- This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library☆22Mar 5, 2024Updated 2 years ago
- A real-time software for turn-taking, backchannel, and head-nodding prediction☆94Updated this week
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- SpeechGateway - A reverse proxy server that enhances speech synthesis with essential, extensible features. 🦉💬☆32Feb 8, 2026Updated 2 months ago