MegEngine / End-to-end-ASR-TransformerLinks
An end to end ASR Transformer model training repo
☆13Updated 3 years ago
Alternatives and similar repositories for End-to-end-ASR-Transformer
Users that are interested in End-to-end-ASR-Transformer are comparing it to the libraries listed below
Sorting:
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 6 years ago
- ASR project with pytorch-lightning☆20Updated 4 months ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 4 years ago
- Curriculum Vitae of Quan Wang☆15Updated last month
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- SpeechYOLO Interspeech 2019☆44Updated 2 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Updated 6 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Updated 2 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆53Updated 4 years ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 4 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- High performance pytorch modules☆18Updated 2 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆10Updated 4 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 5 years ago
- ☆75Updated 3 years ago
- K-FACE Analysis Project on Pytorch☆11Updated 3 years ago
- A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )☆24Updated 5 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Updated 2 years ago