ishine / ContextNet
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recognition using global context
☆17Updated 4 years ago
Alternatives and similar repositories for ContextNet:
Users that are interested in ContextNet are comparing it to the libraries listed below
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆37Updated 3 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 2 years ago
- MultiSV: scripts for data preparation☆27Updated 2 months ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- A simple package for Guided source separation (GSS)☆118Updated 10 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆118Updated 2 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated last month
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆103Updated 3 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆33Updated 4 years ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆52Updated 3 months ago
- ☆115Updated 3 years ago
- SpEx+(tied) source code☆80Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- ☆50Updated 4 years ago
- PyTorch implementation of RPNSD☆60Updated 9 months ago
- py-webrtcvad wrapper for trimming speech clips☆48Updated 2 years ago
- Conferencing Speech Challenge☆90Updated 3 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- ☆101Updated 4 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 5 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆104Updated last year
- Implementation of audio degradation processes☆101Updated 9 years ago
- ☆29Updated 3 years ago