gzhu06 / PodcastFillers_UtilsLinks

Utility functions for preprocessing PodcastFillers dataset

☆9

Alternatives and similar repositories for PodcastFillers_Utils

Users that are interested in PodcastFillers_Utils are comparing it to the libraries listed below

Sorting:

keonlee9420 / Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆57Updated 3 years ago
rishikksh20 / Avocodo-pytorch
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆123Updated 3 years ago
xinjli / alqalign
multilingual speech aligner
☆75Updated last year
Berkeley-Speech-Group / Speech-Articulatory-Coding
☆43Updated 2 months ago
dan-wells / fastpitch
NVIDIA's FastPitch, extracted from the DeepLearningExamples repository
☆13Updated last year
kan-bayashi / LibriTTSLabel
Alignment files of LibriTTS.
☆64Updated 5 years ago
spring-media / DeepForcedAligner
☆80Updated last year
BridgetteSong / ExpressiveTacotron
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Updated 2 years ago
xinjli / ucla-phonetic-corpus
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆43Updated 2 years ago
hcy71o / AutoVocoder
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆70Updated 2 years ago
neosapience / editts
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆117Updated 2 years ago
lstrgar / self-supervised-phone-segmentation
Phoneme segmentation using pre-trained speech models
☆55Updated 2 years ago
CSTR-Edinburgh / qualtreats
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
☆36Updated last year
pzelasko / kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆67Updated 2 months ago
hs-oh-prml / DiffProsody
☆68Updated 2 years ago
richardbaihe / a3t
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
☆88Updated 11 months ago
kamperh / vqwordseg
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆37Updated last year
prml-lab-speech-team / demo
☆26Updated last year
seastar105 / pflow-encodec
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆74Updated last year
interactiveaudiolab / ppgs
High-Fidelity Neural Phonetic Posteriorgrams
☆112Updated 5 months ago
Aria-K-Alethia / laughter-synthesis
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆76Updated 2 years ago
unilight / s3prl-vc
S3PRL-VC: A Voice Conversion Toolkit based on S3PRL
☆101Updated last year
RF5 / transfusion-asr
Transcribing Speech with Multinomial Diffusion, training code and models.
☆78Updated last year
ndkgit339 / spe-dss
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆44Updated 2 years ago
b04901014 / FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
☆88Updated 3 years ago
Daisyqk / Automatic-Prosody-Annotation
☆111Updated 3 years ago
Dapwner / CVAE-Tacotron
☆24Updated last year
kamepong / ConvS2S-VC
☆29Updated 3 years ago
facebookresearch / emphassess
This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …
☆23Updated last year
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆49Updated last year