ishine / ContextNetLinks
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recognition using global context
☆17Updated 4 years ago
Alternatives and similar repositories for ContextNet
Users that are interested in ContextNet are comparing it to the libraries listed below
Sorting:
- Speaker embedding (d-vector) trained with GE2E loss☆284Updated last year
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Updated 3 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Updated 5 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Updated 5 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆112Updated 6 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆108Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆319Updated 4 years ago
- End-to-End Neural Diarization☆406Updated 4 years ago
- target speaker extraction and verification for multi-talker speech☆181Updated 4 years ago
- Variational Bayes HMM over x-vectors diarization☆275Updated last year
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆148Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆94Updated 3 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated 2 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆105Updated 5 years ago
- Voice Activity Detection (VAD) using deep learning.☆197Updated 5 years ago
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆213Updated 2 years ago
- Speech separation with utterance-level PIT experiments☆104Updated 7 years ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆283Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆143Updated 2 months ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- ☆38Updated 3 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆266Updated last month
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆136Updated last year
- Diarization scoring tools.☆256Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- ESPnet Model Zoo☆254Updated 2 years ago
- ☆104Updated 4 years ago
- A pure python module for reading and writing kaldi ark files☆260Updated 5 months ago