ex3ndr / supervoice-gpt-facodecView external linksLinks
GPT for FACodec
☆13Mar 25, 2024Updated last year
Alternatives and similar repositories for supervoice-gpt-facodec
Users that are interested in supervoice-gpt-facodec are comparing it to the libraries listed below
Sorting:
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- ☆25Mar 6, 2024Updated last year
- My vocoder experiments☆31Jul 26, 2025Updated 6 months ago
- SpeechFlow neural network implementation☆22Aug 8, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- ☆10Sep 17, 2021Updated 4 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆10Apr 8, 2024Updated last year
- Implementation for NATv2.☆23Feb 20, 2021Updated 4 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Oct 31, 2023Updated 2 years ago
- a lightweight voice conversion☆86Sep 2, 2024Updated last year
- Declare your datasets and download them using a simple tool☆14Aug 2, 2024Updated last year
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- PyTorch-based implementations of short-time Fourier transform☆15Jul 21, 2025Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- VoiceLDM: Text-to-Speech with Environmental Context☆191Aug 9, 2024Updated last year
- Official implementation of MelHuBERT☆68Oct 26, 2024Updated last year
- ☆13Aug 11, 2018Updated 7 years ago
- ☆15Oct 11, 2019Updated 6 years ago
- ☆99Jan 19, 2026Updated 3 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17May 20, 2025Updated 8 months ago
- RND1: Scaling Diffusion Language Models☆173Jan 12, 2026Updated last month
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago