ishine / ContextNetLinks
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recognition using global context
☆17Updated 4 years ago
Alternatives and similar repositories for ContextNet
Users that are interested in ContextNet are comparing it to the libraries listed below
Sorting:
- an Audio-Visual Voice Activity Detection using Deep Learning☆49Updated 6 years ago
- A simple package for Guided source separation (GSS)☆124Updated last year
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Updated 3 years ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- ☆29Updated 3 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆69Updated 4 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 3 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆106Updated 3 years ago
- Official repository of our paper: https://arxiv.org/abs/2010.15366☆62Updated 3 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- SpEx+(tied) source code☆86Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆78Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆141Updated 3 weeks ago
- ☆121Updated 3 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- ☆54Updated last year
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated last year
- Alignment files of LibriTTS.☆62Updated 5 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆53Updated last month
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆55Updated last year
- ☆56Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago
- STOI loss function in PyTorch☆91Updated 8 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆62Updated 2 years ago
- ☆34Updated 4 years ago