MegEngine / End-to-end-ASR-TransformerLinks
An end to end ASR Transformer model training repo
☆13Updated 4 years ago
Alternatives and similar repositories for End-to-end-ASR-Transformer
Users that are interested in End-to-end-ASR-Transformer are comparing it to the libraries listed below
Sorting:
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆32Updated 6 years ago
- ASR project with pytorch-lightning☆20Updated 9 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 4 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- A handwritten Chemical Structure Image data set named EDU-CHEMC, which consists of totally 52,987 handwritten molecular structure images …☆14Updated 8 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- Curriculum Vitae of Quan Wang☆15Updated last month
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Updated 5 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Updated 3 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 6 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆24Updated 6 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Updated 4 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Comprehensive Python library for speech and voice.☆32Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Anonymous ICLR Submission☆14Updated 6 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Updated 3 years ago