shivammehta25 / OverFlowView external linksLinks
Putting flows on top of neural transducers for better TTS
☆65Jan 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for OverFlow
Users that are interested in OverFlow are comparing it to the libraries listed below
Sorting:
- Neural HMMs are all you need (for high-quality attention-free TTS)☆163Jan 19, 2026Updated 3 weeks ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- ☆52Jul 16, 2025Updated 6 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Prosody and Pronunciation Modification Network☆62May 5, 2025Updated 9 months ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆26Sep 22, 2022Updated 3 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Self-supervised Generative LM-based Voice Conversion☆53Apr 24, 2025Updated 9 months ago
- Project of Singing Voice Conversion.☆15Oct 27, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆52Jun 24, 2025Updated 7 months ago
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- ☆259May 15, 2023Updated 2 years ago
- ☆66Aug 16, 2023Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆46Apr 16, 2023Updated 2 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago