Prosodic Speech Segmentation with Transformers
☆26Feb 25, 2024Updated 2 years ago
Alternatives and similar repositories for PSST
Users that are interested in PSST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- text to speech☆10Mar 19, 2024Updated 2 years ago
- ☆22Apr 4, 2023Updated 3 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Labeled data for homograph disambiguation☆62Jun 1, 2023Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Dataset release for Emotional TTS in Indian Accent☆40Mar 25, 2026Updated last month
- ☆41May 15, 2023Updated 2 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 3 years ago
- ☆40Jan 24, 2023Updated 3 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆90Jul 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- An R package for implementing and evaluating Maximum Entropy Optimality Theory models☆10Feb 24, 2026Updated 2 months ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 8 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆196Nov 13, 2023Updated 2 years ago
- Unofficial PyTorch Implementation of StarGAN-ZSVC☆14Aug 5, 2021Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Nov 10, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆169Apr 10, 2024Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆151Apr 5, 2024Updated 2 years ago
- readers that enable reading kaldi ark in tensorflow☆17Mar 7, 2018Updated 8 years ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- Framework for training dependency parsing models.☆12Jun 12, 2024Updated last year
- Command line interface from graph rewriting☆10Updated this week
- Code repository for FreGrad☆52May 19, 2024Updated last year