tts-tutorial / interspeech2022Links

☆163

Alternatives and similar repositories for interspeech2022

Users that are interested in interspeech2022 are comparing it to the libraries listed below

Sorting:

b04901014 / FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆88Updated 3 years ago
tts-tutorial / book
☆63Updated 2 years ago
yistLin / universal-vocoder
A PyTorch implementation of the universal neural vocoder
☆67Updated 4 years ago
Daisyqk / Automatic-Prosody-Annotation
☆111Updated 3 years ago
keonlee9420 / VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
☆72Updated 4 years ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆110Updated last year
tuanh123789 / AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
☆97Updated 3 years ago
AndreevP / wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
☆162Updated 2 years ago
descriptinc / cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆188Updated 2 years ago
howard1337 / S2VC
☆99Updated 4 years ago
gmltmd789 / UnitSpeech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
☆136Updated last year
WangHelin1997 / SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆77Updated last year
Takaaki-Saeki / DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
☆158Updated 7 months ago
Mikxox / EnCodec_Trainer
☆60Updated 2 years ago
Tomiinek / Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆44Updated 5 years ago
LEEYOONHYUNG / BVAE-TTS
Official implementation of BVAE-TTS
☆173Updated 2 years ago
b04901014 / UUVC
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆81Updated 2 years ago
kan-bayashi / LibriTTSLabel
Alignment files of LibriTTS.
☆64Updated 5 years ago
unilight / s3prl-vc
S3PRL-VC: A Voice Conversion Toolkit based on S3PRL
☆101Updated last year
nii-yamagishilab / mos-finetune-ssl
☆98Updated 2 years ago
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆81Updated 2 years ago
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
facebookresearch / vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
☆210Updated 2 months ago
keonlee9420 / Comprehensive-E2E-TTS
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆146Updated 3 years ago
xinjli / alqalign
multilingual speech aligner
☆75Updated last year
lucasnewman / best-rq-pytorch
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
☆122Updated last year
cyhuang-tw / AdaIN-VC
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆117Updated 4 years ago
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Updated 3 months ago
rishikksh20 / Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆120Updated 3 years ago
neosapience / editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆117Updated 2 years ago