☆14Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for PEL-TTS
Users that are interested in PEL-TTS are comparing it to the libraries listed below
Sorting:
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Aug 29, 2023Updated 2 years ago
- ☆26Jun 5, 2024Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- Knowledge-Infused Topic Model (KITM) for empathetic dialogue generation, integrating topic modeling with commonsense reasoning from COMET…☆22Nov 30, 2025Updated 3 months ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- Code for SRMRL☆19Sep 5, 2021Updated 4 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- ☆19Jul 22, 2022Updated 3 years ago
- ☆19Nov 1, 2021Updated 4 years ago
- ☆20Sep 2, 2021Updated 4 years ago
- ☆19Jan 21, 2022Updated 4 years ago
- ☆22Apr 4, 2023Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- Code for Submission titled "Guidance Learning for Multi-Domain DIalogue Management"☆23Feb 2, 2022Updated 4 years ago
- ☆12Oct 28, 2025Updated 4 months ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- ☆25Jan 24, 2023Updated 3 years ago
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- A Chinese Expressive Long-dialogue Speech Dataset with Scripts☆21Nov 11, 2024Updated last year
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆13Nov 22, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- ☆30Feb 23, 2026Updated last week
- ☆50Aug 16, 2023Updated 2 years ago