MegEngine / End-to-end-ASR-TransformerLinks
An end to end ASR Transformer model training repo
☆13Updated 3 years ago
Alternatives and similar repositories for End-to-end-ASR-Transformer
Users that are interested in End-to-end-ASR-Transformer are comparing it to the libraries listed below
Sorting:
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- ASR project with pytorch-lightning☆20Updated 5 months ago
- Curriculum Vitae of Quan Wang☆15Updated 2 months ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 6 years ago
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 4 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
- bumble bee transformer☆14Updated 4 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".☆44Updated 3 years ago
- Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.☆23Updated 5 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch☆53Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Updated 2 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- SpeechYOLO Interspeech 2019☆44Updated 3 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37Updated last year
- PyTorch re-implementation of Speech-Transformer☆102Updated 3 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Updated 6 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Updated 2 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- neural network based speaker embedder☆25Updated 2 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆10Updated 4 years ago