Mu-Y / DiariSTLinks
☆19Updated last year
Alternatives and similar repositories for DiariST
Users that are interested in DiariST are comparing it to the libraries listed below
Sorting:
- ☆28Updated 5 months ago
- Unofficial implementation of wavenext vocoder☆48Updated 10 months ago
- ☆28Updated last month
- Just another FastSpeech 2 but cleaner code :)☆26Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆27Updated 2 months ago
- ☆39Updated 9 months ago
- ☆34Updated last year
- (WIP)long form speech generatoins☆31Updated 3 months ago
- faster inference☆28Updated 5 months ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆23Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆27Updated 9 months ago
- ☆25Updated 8 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆57Updated 2 weeks ago
- ☆19Updated last year
- Streaming Vocos☆28Updated last month
- ☆12Updated 5 months ago
- ☆18Updated 10 months ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago
- ☆28Updated last week
- ☆68Updated 10 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆70Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Updated last year
- multilingual speech aligner☆74Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆33Updated last year
- Megatts2 use HierSpeechpp's vocoder☆18Updated 7 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆52Updated 11 months ago
- ☆75Updated last week