TouchSky-Lab / Awesome-Text-to-Speech-TTSView external linksLinks
Awesome TTS
☆62Sep 16, 2021Updated 4 years ago
Alternatives and similar repositories for Awesome-Text-to-Speech-TTS
Users that are interested in Awesome-Text-to-Speech-TTS are comparing it to the libraries listed below
Sorting:
- ☆14Aug 19, 2024Updated last year
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆47Apr 14, 2025Updated 10 months ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆54Sep 14, 2022Updated 3 years ago
- ☆41May 19, 2023Updated 2 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 4 months ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- ☆10Sep 2, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆32Nov 18, 2025Updated 2 months ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Mar 5, 2022Updated 3 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- Openfst mirror with some fixes☆14Aug 23, 2024Updated last year
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆53Jun 29, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated last year
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Jun 16, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- Kokoro Language Model Training Script for Russian (Ruslan Corpus)☆34Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆13May 17, 2020Updated 5 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- ☆15Apr 2, 2025Updated 10 months ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Oct 12, 2023Updated 2 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Finetuning VITS Efficiently☆33Nov 6, 2023Updated 2 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 10 months ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- ☆22Jan 29, 2026Updated 2 weeks ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 5 months ago