Prosodic Speech Segmentation with Transformers
☆26Feb 25, 2024Updated 2 years ago
Alternatives and similar repositories for PSST
Users that are interested in PSST are comparing it to the libraries listed below
Sorting:
- text to speech☆10Mar 19, 2024Updated last year
- ☆22Apr 4, 2023Updated 2 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- ☆41May 15, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Labeled data for homograph disambiguation☆62Jun 1, 2023Updated 2 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- Dataset release for Emotional TTS in Indian Accent☆40Sep 2, 2022Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- ☆10Apr 8, 2024Updated last year
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆147Apr 5, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Code repository for FreGrad☆52May 19, 2024Updated last year
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- GPT for FACodec☆13Mar 25, 2024Updated last year
- Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).☆11Apr 14, 2020Updated 5 years ago
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆14Apr 10, 2023Updated 2 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- Manipulating semantic data within Python☆18Jan 14, 2025Updated last year
- ☆55Jan 13, 2023Updated 3 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- readers that enable reading kaldi ark in tensorflow☆17Mar 7, 2018Updated 7 years ago