oleges1 / quartznet-pytorchLinks
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆27Updated 3 years ago
Alternatives and similar repositories for quartznet-pytorch
Users that are interested in quartznet-pytorch are comparing it to the libraries listed below
Sorting:
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 4 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆17Updated 4 years ago
- ☆37Updated 2 months ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- The VoxTube dataset official repository☆69Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- ☆29Updated 3 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆106Updated 3 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated 2 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Updated 3 years ago
- A simple package for Guided source separation (GSS)☆124Updated last year
- ☆25Updated 10 months ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆118Updated last year
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆69Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- ☆59Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- ☆17Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 3 years ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆49Updated 3 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- ☆26Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- ☆54Updated last year
- ☆41Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 5 years ago