MegEngine / End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for End-to-end-ASR-Transformer
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 3 years ago
- Curriculum Vitae of Quan Wang☆14Updated this week
- Anonymous ICLR Submission☆14Updated 5 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆24Updated 2 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 3 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- Example implementation of Monotonic Chunkwise Attention.☆50Updated 6 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- Implementaion RNN tranceducer☆21Updated 5 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago