uthree / auris_experimental_vits_dspView external linksLinks
AI based singing voice synthesis
☆37Jun 10, 2024Updated last year
Alternatives and similar repositories for auris_experimental_vits_dsp
Users that are interested in auris_experimental_vits_dsp are comparing it to the libraries listed below
Sorting:
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- ☆15Nov 10, 2025Updated 3 months ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- 💠 Aivis: AI Voice Imitation System☆27Feb 25, 2024Updated last year
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆21May 2, 2023Updated 2 years ago
- 44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…☆39Jun 2, 2023Updated 2 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- Aivis Voice Model File (.aivm/.aivmx) Generator / Editor☆15Feb 5, 2026Updated last week
- 声質変換 VST☆64Oct 18, 2025Updated 3 months ago
- RVCで音声学習をするための便利スクリプト集☆26Apr 8, 2023Updated 2 years ago
- ☆26Mar 20, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆15Apr 2, 2025Updated 10 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆27Aug 10, 2024Updated last year
- SpeechGateway - A reverse proxy server that enhances speech synthesis with essential, extensible features. 🦉💬☆31Feb 8, 2026Updated last week
- ☆149Sep 8, 2025Updated 5 months ago
- ☆28Oct 28, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆91Nov 24, 2025Updated 2 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- jp-localization☆48Apr 11, 2023Updated 2 years ago
- This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library☆22Mar 5, 2024Updated last year
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- A vector similarity search engine for humans🥳☆18Oct 30, 2023Updated 2 years ago
- ☆49Jul 22, 2024Updated last year
- ☆33May 1, 2025Updated 9 months ago
- This is a repository for comparing voice changer results and searching datasets and trained models.☆30May 21, 2023Updated 2 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 4 months ago
- Enhanced Piper TTS with Japanese support, WebAssembly, multi-GPU training, and quality improvements. Features OpenJTalk integration, brow…☆29Updated this week