🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆14Feb 4, 2026Updated last month
Alternatives and similar repositories for YourTTS
Users that are interested in YourTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Onnx compatible styletts2 code☆17Feb 28, 2026Updated 3 weeks ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- voistock站点voicelist页面免费音源检索并下载程序(可在线体验)☆24May 1, 2024Updated last year
- Official Repository for ICASSP 2024 Paper "SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription"☆28Dec 6, 2024Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆49Jan 19, 2026Updated 2 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Aug 30, 2024Updated last year
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆63Feb 19, 2025Updated last year
- ☆11Mar 22, 2023Updated 3 years ago
- Microsoft SurfaceBook 1 Hackintosh with macOS 11 supported, using OpenCore.☆13Jul 17, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 3 months ago
- A real-time voice conversion model based on VITS.☆14Aug 1, 2024Updated last year
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- 干中学|| build_mcp_from_scratch☆26Oct 15, 2025Updated 5 months ago
- Object tracking using OpenCV☆25Feb 17, 2019Updated 7 years ago
- repository for accepted paper in BigData 2022 conference☆12Jul 17, 2023Updated 2 years ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Jan 20, 2020Updated 6 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Jul 3, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- 学习用PyTorch创作唐诗☆17Mar 17, 2019Updated 7 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37Mar 10, 2022Updated 4 years ago
- Megatts2 use HierSpeechpp's vocoder☆18Dec 2, 2024Updated last year
- ☆19Mar 2, 2024Updated 2 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Oct 4, 2022Updated 3 years ago
- Release of the ConditionalQA dataset☆21Nov 2, 2021Updated 4 years ago
- MaskGCT demo page☆14Feb 9, 2025Updated last year
- NVIDIA's TalkNET - Train and Synthesize on colab☆15Dec 6, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- BUPT Software Engineering Homework☆16Jun 14, 2020Updated 5 years ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- a novel framework for stock portfolio trading that employs a 'Relaxation and Refinement' strategy to boost the Soft Actor-Critic (SAC) ag…☆36Mar 25, 2024Updated 2 years ago
- onnxruntime for RaspberryPi armv7l☆23Nov 25, 2021Updated 4 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Sep 21, 2022Updated 3 years ago