tiefenauer / forced-alignmentLinks
Forced alignment based on speech pauses using an RNN
☆9Updated 5 years ago
Alternatives and similar repositories for forced-alignment
Users that are interested in forced-alignment are comparing it to the libraries listed below
Sorting:
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆14Updated 6 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- A handy dataset of noises for ASR☆21Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Updated 4 years ago
- ☆64Updated 3 years ago
- ☆56Updated 2 years ago
- Text frontend for ESPnet tts recipes☆31Updated 4 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- ☆26Updated 4 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Deep Speech Distances PyTorch☆28Updated 3 years ago
- A pytorch implementation of FFTNet.☆37Updated 6 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Updated 4 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆31Updated 2 years ago
- ☆32Updated 3 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Prosody and Pronunciation Modification Network☆54Updated last month
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- Viterbi decoding in PyTorch☆34Updated 3 weeks ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 3 years ago
- with alignment learning and continuous wavelet transform☆21Updated 2 years ago
- ☆42Updated 3 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago