oleges1 / quartznet-pytorch
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆27Updated 3 years ago
Alternatives and similar repositories for quartznet-pytorch:
Users that are interested in quartznet-pytorch are comparing it to the libraries listed below
- ☆29Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 4 years ago
- ☆25Updated 6 months ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆102Updated 3 years ago
- Implementation of the AlignTTS☆76Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆69Updated 2 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 3 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Clustering-based methods for overlapping diarization☆75Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- ☆16Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- The VoxTube dataset official repository☆67Updated last year
- ☆91Updated 3 years ago
- ☆30Updated 2 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆95Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Updated 3 years ago
- Constrained Permutation Invariant Training, Speech Separation☆46Updated 4 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 7 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year