burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆215Updated last year
Alternatives and similar repositories for EfficientConformer:
Users that are interested in EfficientConformer are comparing it to the libraries listed below
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆141Updated 2 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆104Updated 3 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆251Updated 2 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Updated 2 years ago
- Towards hot directions in industrial end to end speech recognition☆327Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆141Updated last year
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆130Updated 3 years ago
- 语音识别 论文 前沿☆46Updated 3 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- ☆273Updated 4 years ago
- Example code for a neural transducer model.☆61Updated last year
- CUDA-Warp RNN-Transducer☆212Updated 2 years ago
- PyTorch re-implementation of Speech-Transformer☆101Updated 3 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆70Updated 4 months ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆108Updated 2 years ago
- A CRF-based ASR Toolkit☆332Updated 8 months ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆118Updated 2 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆76Updated 4 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆165Updated 3 years ago
- ☆68Updated 3 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆375Updated 2 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆73Updated 2 years ago
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆200Updated 6 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆144Updated 2 years ago
- ☆149Updated 2 years ago
- PyTorch implementation of Densely Connected Time Delay Neural Network☆87Updated 2 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆197Updated 2 weeks ago