MegEngine / End-to-end-ASR-TransformerLinks
An end to end ASR Transformer model training repo
☆13Updated 4 years ago
Alternatives and similar repositories for End-to-end-ASR-Transformer
Users that are interested in End-to-end-ASR-Transformer are comparing it to the libraries listed below
Sorting:
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- ASR project with pytorch-lightning☆20Updated 8 months ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆32Updated 6 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 4 years ago
- A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …☆14Updated 7 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 5 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- Curriculum Vitae of Quan Wang☆15Updated 2 months ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 4 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 4 years ago
- 逻辑回归和单层softmax的解析解☆12Updated 4 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆10Updated 4 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Updated 6 years ago
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Updated 4 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆54Updated 4 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- ☆12Updated 5 years ago
- PyTorch CTC Decoder bindings☆14Updated 8 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 4 years ago
- Anonymous ICLR Submission☆14Updated 6 years ago
- Comprehensive Python library for speech and voice.☆32Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Updated 4 years ago