Diamondfan / cassnat_asrView external linksLinks
Implementation of CTC alignment-based single step non-autoregressive transformer
☆13Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for cassnat_asr
Users that are interested in cassnat_asr are comparing it to the libraries listed below
Sorting:
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated last year
- ☆16May 25, 2019Updated 6 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- ☆18Sep 19, 2023Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- mWER loss implementation in tensorflow☆30Sep 7, 2020Updated 5 years ago
- This is now the official location of the Kaldi project.☆24Nov 13, 2019Updated 6 years ago
- ☆67Mar 25, 2022Updated 3 years ago
- PyTorch bindings for Warp-CTC☆42Dec 6, 2019Updated 6 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆149Jan 6, 2020Updated 6 years ago
- Global Open Simulator☆10May 5, 2025Updated 9 months ago
- ☆10Oct 20, 2022Updated 3 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated last week
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- ☆41Jun 25, 2018Updated 7 years ago
- ☆13Oct 3, 2025Updated 4 months ago
- Source code for ACL 2023 paper "End-to-End Simultaneous Speech Translation with Differentiable Segmentation"☆37Dec 6, 2023Updated 2 years ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Greedy Adaptive Dictionary (GAD) is a learning algorithm that sets out to find sparse atoms for speech signals.☆11Oct 1, 2018Updated 7 years ago
- ☆11Dec 28, 2023Updated 2 years ago
- Directional sparse filtering for blind speech separation☆10Jun 8, 2021Updated 4 years ago
- ☆14Nov 26, 2024Updated last year
- Speech Signal Processing project with different types of filters.☆10Aug 7, 2017Updated 8 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- ☆10Oct 25, 2019Updated 6 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated last year
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- An election resource by and for citizens.☆15Jun 9, 2018Updated 7 years ago
- Some PyTorch code for the Kaggle Speech Recognition Challenge☆12Feb 7, 2019Updated 7 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆189Jan 29, 2020Updated 6 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago