prosodylab / prosodylab.dictionaries
A repository for dictionaries to be used with the Prosodylab-Aligner
☆17Updated 10 years ago
Alternatives and similar repositories for prosodylab.dictionaries:
Users that are interested in prosodylab.dictionaries are comparing it to the libraries listed below
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Updated 4 years ago
- ☆40Updated 3 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- ☆26Updated 3 years ago
- ☆12Updated 2 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Google's TPGST reimplementation.☆34Updated 5 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- Easier analysis of large speech corpora☆22Updated 3 years ago
- ☆51Updated 6 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- This repository contains laughter-related synthesis systems.☆13Updated 4 years ago
- Long audio alignment using Kaldi☆24Updated 3 years ago
- A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default☆10Updated 10 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 8 months ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆23Updated 4 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆28Updated 10 months ago
- ☆45Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- ☆25Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- ☆15Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago