GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago
Alternatives and similar repositories for supervoice-gpt-facodec
Users that are interested in supervoice-gpt-facodec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆62Oct 23, 2024Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Mar 22, 2024Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- SpeechFlow neural network implementation☆23Aug 8, 2024Updated last year
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 10 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆18May 20, 2025Updated last year
- [Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec☆67Mar 11, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 4 years ago
- a lightweight voice conversion☆86Feb 25, 2026Updated 3 months ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- VoiceBox neural network implementation☆110Aug 2, 2024Updated last year
- ☆101Jan 19, 2026Updated 4 months ago
- Declare your datasets and download them using a simple tool☆14Aug 2, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- [ACL 2025 Oral] Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models☆209Jun 25, 2025Updated 11 months ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆78Oct 22, 2024Updated last year
- Training code for FAcodec presented in NaturalSpeech3☆242Aug 26, 2024Updated last year
- ☆39Oct 1, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- PyTorch-based implementations of short-time Fourier transform☆15Jul 21, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implementation of MelHuBERT☆70Feb 21, 2026Updated 3 months ago
- ☆10Sep 17, 2021Updated 4 years ago
- Deep Multi-Speech model☆11Jul 25, 2018Updated 7 years ago
- ☆28Nov 15, 2023Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆97Jul 4, 2024Updated last year
- ☆69Jul 29, 2023Updated 2 years ago
- ☆18May 14, 2025Updated last year