litagin02 / Style-Bert-VITS2-EditorView external linksLinks
☆27Aug 10, 2024Updated last year
Alternatives and similar repositories for Style-Bert-VITS2-Editor
Users that are interested in Style-Bert-VITS2-Editor are comparing it to the libraries listed below
Sorting:
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Real-time end-to-end singing voice convertion☆23Nov 3, 2024Updated last year
- ☆15Nov 10, 2025Updated 3 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- 大量の音声データから笑い声部分を集めるやつ☆12May 23, 2024Updated last year
- ☆28Nov 15, 2023Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated last week
- source code of EfficientTTS 2☆20Feb 18, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- AI based singing voice synthesis☆37Jun 10, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- 日本語TTS(VITS)の学習と音声合成のGradio WebUI☆42Jan 5, 2024Updated 2 years ago
- An environment where you can try out faster-whisper immediately.☆38Nov 21, 2024Updated last year
- Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.☆1,164Dec 7, 2025Updated 2 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆24May 28, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆13Jul 6, 2022Updated 3 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Oct 20, 2021Updated 4 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago