upskyy / ContextNetLinks

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

☆38

Alternatives and similar repositories for ContextNet

Users that are interested in ContextNet are comparing it to the libraries listed below

Sorting:

sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
sooftware / RNN-Transducer
PyTorch implementation of RNN-Transducer(RNN-T).
☆78Updated 4 years ago
voithru / voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
☆156Updated 3 years ago
upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆143Updated 2 years ago
sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 4 years ago
JoungheeKim / Non-Attentive-Tacotron
This is Pytorch Implementation of Google's Non-attentive Tacotron.
☆57Updated 2 years ago
sooftware / jasper
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Updated 4 years ago
idiap / contextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆20Updated last year
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
nii-yamagishilab / Attention_Backend_for_ASV
Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances
☆50Updated 2 years ago
Xflick / EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
☆108Updated 2 years ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆111Updated 2 years ago
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago
seongmin-kye / meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆74Updated 4 years ago
sooftware / openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
☆35Updated 3 years ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
mayank-git-hub / ETE-Speech-Recognition
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆26Updated last year
lingjzhu / clap-ipa
Keyword spotting and forced alignment in any language
☆63Updated 3 weeks ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆65Updated last year
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆216Updated 2 years ago
nikvaessen / w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆144Updated 3 years ago
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Updated 2 years ago
mispchallenge / misp2022_baseline
☆30Updated 2 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
tts-tutorial / interspeech2022
☆163Updated 2 years ago
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆33Updated 3 years ago
pashanitw / W2V2-BERT-ASR-Training
☆16Updated last year
hechmik / voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
☆68Updated 3 years ago
dmlguq456 / NeXt_TDNN_ASV
Official repository of NeXt-TDNN for speaker verification
☆75Updated 9 months ago