Edresson / Coqui-TTSView external linksLinks
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β37Mar 10, 2022Updated 3 years ago
Alternatives and similar repositories for Coqui-TTS
Users that are interested in Coqui-TTS are comparing it to the libraries listed below
Sorting:
- Pythonηι³ι’ε·₯ε ·β16Dec 5, 2025Updated 2 months ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",β¦β80May 29, 2023Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Jul 16, 2022Updated 3 years ago
- torch version of LPCNetβ22Jul 8, 2020Updated 5 years ago
- β22Apr 4, 2023Updated 2 years ago
- β24Mar 15, 2022Updated 3 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singingβ26Aug 30, 2024Updated last year
- An unofficial PyTorch implementation of "HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversariβ¦β24Feb 5, 2021Updated 5 years ago
- β64Jan 15, 2024Updated 2 years ago
- TTS Text Analyzerβ32Jul 20, 2023Updated 2 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.β57Dec 21, 2022Updated 3 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.β194Jun 8, 2023Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processingβ71Dec 2, 2022Updated 3 years ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"β365Aug 3, 2023Updated 2 years ago
- β26Sep 22, 2022Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.β71Mar 19, 2021Updated 4 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.β29Mar 14, 2019Updated 6 years ago
- PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesisβ73Aug 3, 2021Updated 4 years ago
- The code for aishell-3 baseline acoustic modelβ69Nov 30, 2020Updated 5 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systemsβ39Nov 1, 2023Updated 2 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneβ1,054Nov 4, 2024Updated last year
- Virtual news production using Tacotron2 and Wav2Lipβ11Nov 14, 2023Updated 2 years ago
- An ambient noise detectorβ10Aug 23, 2020Updated 5 years ago
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Chaβ¦β11Apr 16, 2021Updated 4 years ago
- PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer β¦β39May 16, 2021Updated 4 years ago
- The Official Implementation of βContent-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthβ¦β87Dec 20, 2022Updated 3 years ago
- since some people are selling that for $25 I decided to make one myself rq since it's really easy and people shouldn't waste their money β¦β11Feb 2, 2021Updated 5 years ago
- β12Jun 29, 2025Updated 7 months ago
- VS Code tools for NextBASICβ12Apr 22, 2025Updated 9 months ago
- automatic music transcription application written in javaβ12Jan 13, 2013Updated 13 years ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agentsβ¦β10Dec 12, 2024Updated last year
- A codebase for data crawling and preprocessing for TTS and ASR systems training.β22Feb 5, 2026Updated last week
- AI-powered knowledge management without vector embeddings. Built upon Claude Agent SDK, File system based, Agent driven. Maybe slower, buβ¦β88Feb 1, 2026Updated last week
- Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"β12Mar 9, 2024Updated last year
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactioβ¦β11Jan 23, 2026Updated 3 weeks ago
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workloadβ¦β24Oct 19, 2025Updated 3 months ago
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.β18Oct 16, 2025Updated 3 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRLβ101Jun 26, 2024Updated last year
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Networkβ45Dec 1, 2021Updated 4 years ago