MegEngine / End-to-end-ASR-Transformer
An end to end ASR Transformer model training repo
☆13Updated 3 years ago
Alternatives and similar repositories for End-to-end-ASR-Transformer:
Users that are interested in End-to-end-ASR-Transformer are comparing it to the libraries listed below
- ASR project with pytorch-lightning☆20Updated this week
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- Curriculum Vitae of Quan Wang☆15Updated 2 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- ☆25Updated 2 years ago
- bumble bee transformer☆14Updated 3 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- Example implementation of Monotonic Chunkwise Attention.☆51Updated 7 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆25Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- ☆12Updated last year
- ☆29Updated 4 years ago
- Implementaion RNN tranceducer☆22Updated 5 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆52Updated 2 years ago
- The History of Speech Recognition to the Year 2030☆12Updated 3 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆70Updated 2 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆76Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆23Updated 2 years ago